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Acyl CoArChoIesterol Acyltransferase Related Nucleic Acid Sequences 

5 INTRODUCTION 

This application claims the benefit of U.S. Provisional Application Number 
60/088,143 filed June 5, 1998, and U.S. Provisional Application Number 60/108,389 filed 
November 12, 1998. 
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Technical Field 

The present invention is directed to nucleic acid and amino acid sequences and 
constructs, and methods related thereto. 

15 Background 

Through the development of plant genetic engineering techniques, it is now 
possible to produce a transgenic variety of plant species to provide plants which have 
novel and desirable characteristics. For example, it is now possible to genetically engineer 
plants for tolerance to environmental stresses, such as resistance to pathogens and 
2 0 tolerance to herbicides and to improve the quality characteristics of the plant, for example , 
improved fatty acid compositions. However, the number of useful nucleotide sequences for 
the engineering of such characteristics is thus far limited and the speed with which new 
useful nucleotide sequences for engineering new characteristics is slow. 

There is a need for improved means to obtain or manipulate compositions of sterols 

2 5 and fatty acids from biosynthetic or natural plant sources. For example, novel oil products, 

improved sources of synthetic triacylglycerols (triglycerides), alternative sources of 
commercial oils, such as tropical oils (/.£., palm kernel and coconut oils), and plant oils 
found in trace amounts from natural sources are desired for a variety of industrial and food 
uses. Or, the ability to increase sterol production in plants may provide for novel sources 

3 0 of sterols for use in human and animal nutrition. 

To this end, the triacylglycerol (TAG) biosynthesis system and sterol biosynthesis 
in mammalian tissues, yeast and plants has been studied. 
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Sterol biosynthesis branches from the farnesyl diphosphate intermediate in the 
isoprenoid pathway. Sterol biosynthesis occurs via amevalonate dependent pathway in 
mammals and higher plants (Goodwin,(1981) Biosynthesis of Isoprenoid Compounds, vol 
1 (Porter, J.W. & Spurgeon, S.L., eds) pp.443-480, John Wiley and Sons, New York), 
5 while in green algae sterol biosynthesis is thought to occur via a mevalonate independent 
pathway (Schwender, et aL (1997) Physiology, Biochemistty, and Molecular Biology of 
Plant Lipids^ (Williams, J.P., Khan, M.U., and Lem, N.W., eds) pp. 180-182, Kluwer 
Academic Publishers. Norwell, MA). 

The solubility characteristics of steroyl esters suggests that this is the storage form 

10 of sterols (Chang, et aL, (1997) Annu. Rev, Biochem., 66:613-638). Sterol O- 

acyltransferase enzymes such as acyl CoAxholesterol acyltransferase (ACAT) catalyze 
the formation of cholesterol esters, and thus are key to controlling the intracellular 
cholesterol storage. 

Such ACATs have been the subject of many research efforts, particularly for 

15 applications involving the reduction of cellular cholesterol storage in humans. Several 
studies suggest that cholesterol esters contribute significantly to the early formation of 
foam cells in atherosclerosis in humans (Fowler, et al (1979) Lab. Invest. 41:372-378; 
Schaffner et aL (1980) Am. J. Pathol 100:57-80; Lupu, et aL (1987) Arterosclerosis 
67:127-142; Brown et aL (1983) Annu. Rev. Biochem. 52:223-261; the entirety of which 

2 0 are incorporated herein by reference)and by blocking ACAT, intracellular cholesterol 

esters are significantly reduced (Ross,ef aL (1986) J. Biol Chem. 259:815-819; Tabas. et 
aL (1986) J. BioL Chem. 261:3147-3155; Cadigan, et al (1988)7. Lipid Res. 29:1683- 
1692; Bocan et al. (1991) Arte rioscler. Thromb. 1 1: 1830-1843, the entirety of which are 
incorporated herein by reference). Thus, directly inhibiting ACAT within the arterial wall 

2 5 may inhibit the progression of atherosclerotic lesions without lowering total plasma 

cholesterol. 

TAG biosynthesis occurs in the cytoplasmic membranes of plant seed tissues 
which accumulate storage triglycerides ("oil"), fatty acyl groups are added sequentially by 
specific acyltransferase enzymes to thesrz-l, sn-2 and .y/2-3 positions of glycerol-3- 

3 0 phosphate (G3P) to form TAG. This pathway is commonly referred to as the Kennedy or 

G3P pathway. 

The first step in TAG formation is the acylation of the sn-l position of glycero- 
phosphate, catalyzed by glycerophosphate acyltransferase. to form lysophosphatidic acid. 
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The lysophosphatidic acid is subsequently acylated at the sn-2 position by 
lysophosphatidic acid acyltransferase (LPAAT) to create phosphatidic acid. The 
phosphatidic acid is subsequently dephosphorylated at the sn-3 position by phosphatidic 
acid phosphatase to form s/z-l,2-diacylglycerol (DAG). 

An important step in the formation of TAG is the acylation of the sn-3 position of 
sn-l,2-diacylglycerol by diacylglycerol acyltransferase (DAG AT, EC 2.3.1.20) ultimately 
forming triacylglycerol (TAG). 

The characterization of diacylglycerol acyltransferase (also known as DAGAT) 
and acyl CoA:cholesterol acyltransferase (also known as ACAT) is useful for the further 
study of plant fatty acid and sterol synthesis systems and for the development of novel 
and/or alternative sterol and oils sources. Furthermore, identification of novel ACAT 
sequences may provide a novel means to inhibit intracellular cholesterol ester formation 
in animals, thus reducing atherosclerosis. Studies of plant mechanisms may provide 
means to further enhance, control, modify, or otherwise alter the total fatty acyl 
composition of triglycerides and oils. Furthermore, the elucidation of the factor(s) critical 
to the natural production of triglycerides in plants is desired, including the purification of 
such factors and the characterization of element(s) and/or cofactors which enhance the 
efficiency of the system. Of particular interest are the nucleic acid sequences of genes 
encoding proteins which may be useful for applications in genetic engineering. 



SUMMARY OF THE INVENTION 

The present invention is directed to acyl-CoA:cholesterol acyltansferase (EC 
2.3.1.26, also referred to herein as ACAT) related polynucleotides, and in particular to 
ACAT-related polynucleotides. The polynucleotides of the present invention include 
those derived from plant, fungal, mammalian and nematode sources. 

Thus, one aspect of the present invention relates to isolated polynucleotide 
sequences encoding ACAT related proteins. In particular, isolated nucleic acid 
sequences encoding ACAT related proteins from human, rat, C elegans. Arabidopsis, 
soybean and corn are provided. 

Another aspect of the present invention relates to oligonucleotides which include 
partial or complete ACAT encoding sequences. 
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It is also an aspect of the present invention to provide recombinant DNA 
constructs which can be used for transcription or transcription and translation (expression) 
of ACAT. In particular, constructs are provided which are capable of transcription or 
transcription and translation in host cells. 

In another aspect of the present invention, methods are provided for production of 
ACAT in a host cell or progeny thereof. In particular, host cells are transformed or 
transfected with a DNA construct which can be used for transcription or transcription and 
translation of ACAT. The recombinant cells which contain ACAT are also part of the 
present invention. 

In a further aspect, the present invention relates to methods of using 
polynucleotide and polypeptide sequences to modify the sterol, sterol ester, and fatty acid 
content as well as composition, particularly in seed tissue of oilseed crops. Plant cells 
having such a modified sterol and fatty acid content are also contemplated herein. 

In yet a further aspect, the present invention relates to methods of using 
polynucleotide and polypeptide sequences to inhibit or delay the germination of seeds. 

The modified plants, seeds and oils obtained by the expression of the ACAT-like 
proteins are also considered pan of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is the encoding sequence to theArabadopsis thaliana ACAT-like protein 
(SEQ ID NO:l). 

Figure 2 are soy ESTs related to the class of ACAT-like proteins (SEQ ID NOs:3- 

6). 

Figure 3 are maize ESTs related to the class of ACAT-like proteins (SEQ ID 
NOs:7-10). 

Figure 4 is a Mortierella ESTs related to the class of ACAT-like proteins (SEQ ID 
NO: 11). 

Figure 5 is an encoding sequence to a mouse protein related to the class of ACAT- 
like proteins (SEQ ID NO: 12). 

Figure 6 is another encoding sequence to a mouse protein related to the class ot 
ACAT-like proteins (SEQ ID NO: 13). 
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Figure 7 is an encoding sequence to a human protein related to the class of ACAT- 
like proteins (SEQ ID NO: 14). 

Figure 8 is an encoding sequence to a rat protein related to the class of ACAT-like 
proteins obtained by assemblage of single pass 5' terminal sequences of different cDN As 
5 (SEQ ID NO: 15). 

Figure 9 is a sequence comparison of various ACAT related amino acid 
sequences. Figure 9A shows aphylogenetic tree showing the relationship between the 
ACAT protein sequences. Figure 9B provides a table showing the percent similarities and 
percent divergence of the known ACAT protein sequences and the novel ACAT-like 
10 sequences. 

Figure 10 provides the coding sequence of the Rat ACAT-like nucleic acid 
sequence derived from a single full length cDNA clone (SEQ ID NO: 16). 

Figure 1 1 provides the amino acid sequence coded by the Rat ACAT-like DNA 
sequence of figure 10 (SEQ ID NO: 17). 
15 Figure 12 provides the amino acid sequence of the ACAT-like protein obtained 

from Caenorhabditis elegans (SEQ ID NO: 18). 

Figure 13 provides a partial nucleic acid sequence of the C. elegans ACAT-like 
protein (SEQ ID NO: 19). 

Figure 14 provides a different partial nucleic acid sequence for the C. elegans 
2 0 ACAT-like protein (SEQ ID NO:20). 

Figure 15 provides a schematic diagram of the binary vector for soybean 
transformation, pCGN88 17. 

Figure 16 provides a schematic diagram of the binary vector for transformation of 
corn, pCGN88I8. 

25 Figure 17 provides an amino acid sequence comparison between the novel rat and 

Arabidopsis ACAT-like proteins and the known human and mouse ACAT protein 
sequences. 

Figure 18 provides the results of the expression of the rat ACAT-like sequence in 
E. coli using thin layer chromatography (TLC). 

30 
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The present invention relates to acyl CoA: cholesterol acyltransferase (hereinafter 
referred to as ACAT) related sequences, particularly the isolated ACAT nucleic acid 
sequences encoding the ACAT protein from host cell sources. A acyl CoA: cholesterol 
acyltransferase related sequences of this invention includes any nucleic acid sequence 
encoding amino acids from a source, such as a protein, polypeptide or peptide, obtainable 
from a cell source, which demonstrate the ability to form sterol esters from sterol and fatty 
acid substrates. By "enzyme reactive conditions" is meant that any necessary conditions 
are available in an environment (i.e., such factors as temperature, pH, lack of inhibiting 
substances) which will permit the enzyme to function. 

Isolated Polynucleotides, Proteins, and Polypeptides 

A first aspect of the present invention relates to isolated ACAT polynucleotides. 
The polynucleotide sequences of the present invention include isolated polynucleotides 
that encode the polypeptides of the invention having a deduced amino acid sequence 
selected from the group of sequences set forth in the Sequence Listing and to other 
polynucleotide sequences closely related to such sequences and variants thereof. 

The invention provides a polynucleotide sequence identical over its entire length 
to each coding sequence as set forth in the Sequence Listing. The invention also provides 
the coding sequence for the mature polypeptide or a fragment thereof, as well as the 
coding sequence for the mature polypeptide or a fragment thereof in a reading frame with 
other coding sequences, such as those encoding a leader or secretory sequence, a pre-, 
pro-, or prepro- protein sequence. The polynucleotide can also include non-coding 
sequences, including for example, but not limited to, non-coding 5' and 3' sequences, 
such as the transcribed, untranslated sequences, termination signals, ribosome binding 
sites, sequences that stabilize mRNA, introns. polyadenylation signals, and additional 
coding sequence that encodes additional amino acids. For example, a marker sequence 
can be included to facilitate the purification of the fused polypeptide. Polynucleotides of 
the present invention also include polynucleotides comprising a structural gene and the 
naturally associated sequences that control gene expression. 

The invention also includes polynucleotides of the formula: 
X-(Ri) n -(R 2 )-(R3)n-Y 
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wherein, at the 5' end, X is hydrogen, and at the 3' end, Y is hydrogen or a metal, R| and 
R ? are any nucleic acid residue, n is an integer between 1 and 3000, preferably between 1 
and 1000 and R2 is a nucleic acid sequence of the invention, particularly a nucleic acid 
sequence selected from the group set forth in the Sequence Listing and preferably SEQ ID 
5 NOs: 1 and 16. In the formula, R 2 is oriented so that its 5' end residue is at the left, bound 
to Rj, and its 3' end residue is at the right, bound to R3. Any stretch of nucleic acid 
residues denoted by either R group, where R is greater than 1, may be either a 
heteropolymer or a homopolymer, preferably aheteropolymer. 

The invention also relates to variants of the polynucleotides described herein that 

10 encode for variants of the polypeptides of the invention. Variants that are fragments of 
the polynucleotides of the invention can be used to synthesize full-length polynucleotides 
of the invention. Preferred embodiments are polynucleotides encoding polypeptide 
variants wherein 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a polypeptide 
sequence of the invention are substituted, added or deleted, in any combination. 

15 Particularly preferred are substitutions, additions, and deletions that are silent such that 
they do not alter the properties or activities of the polynucleotide or polypeptide. 

Further preferred embodiments of the invention that are at least 50%, 60%, or 70% 
identical over their entire length to a polynucleotide encoding a polypeptide of the 
invention, and polynucleotides that are complementary to such polynucleotides. More 

2 0 preferable are polynucleotides that comprise a region that is at least 80% identical over its 
entire length to a polynucleotide encoding a polypeptide of the invention and 
polynucleotides that are complementary thereto. In this regard, polynucleotides at least 
90% identical over their entire length are particularly preferred, those at least 95% 
identical are especially preferred. Further, those with at least 97% identity are highly 

2 5 preferred and those with at least 98% and 99% identity are particularly highly preferred, 

with those at least 99% being the most highly preferred. 

Preferred embodiments are polynucleotides that encode polypeptides that retain 
substantially the same biological function or activity as the mature polypeptides encoded 
by the polynucleotides set forth in the Sequence Listing. 

3 0 The invention further relates to polynucleotides that hybridize to the above- 

described sequences. In particular, the invention relates to polynucleotides that hybridize 
under stringent conditions to the above-described polynucleotides. As used herein, the 
terms "stringent conditions'' and ^stringent hybridization conditions" mean that 
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hybridization will generally occur if there is at least 95% and preferably at least 97% 
identity between the sequences. An example of stringent hybridization conditions is 
overnight incubation at 42°C in a solution comprising 50% formamide, 5x SSC (150 mM 
NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 micrograms/milliliter denatured, sheared salmon 
sperm DNA, followed by washing the hybridization support in 0. 1 x SSC at approximately 
65°C. Other hybridization and wash conditions are well known and are exemplified in 
Sambrook, et al, Molecular Cloning: A Laboratory Manual, Second Edition, cold Spring 
Harbor, NY (1989), particularly Chapter 11. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence set for in the Sequence Listing under 
stringent hybridization conditions with a probe having the sequence of said 
polynucleotide sequence or a fragment thereof; and isolating said polynucleotide 
sequence. Fragments useful for obtaining such a polynucleotide include, for example, 
probes and primers as described herein. 

As discussed herein regarding polynucleotide assays of the invention, for example, 
polynucleotides of the invention can be used as a hybridization probe for RNA, cDNA, or 
genomic DNA to isolate full length cDNAs or genomic clones encoding a polypeptide 
and to isolate cDNA or genomic clones of other genes that have a high sequence 
similarity to a polynucleotide set forth in the Sequence Listing. Such probes will 
generally comprise at least 15 bases. Preferably such probes will have at least 30 bases 
and can have at least 50 bases. Particularly preferred probes will have between 30 bases 
and 50 bases, inclusive. 

The coding region of each gene that comprises or is comprised by a polynucleotide 
sequence set forth in the Sequence Listing may be isolated by screening using a DNA 
sequence provided in the Sequence Listing to synthesize an oligonucleotide probe. A 
labeled oligonucleotide having a sequence complementary to that of a gene of the 
invention is then used to screen a library of cDNA, genomic DNA or mRNA to identify 
members of the library which hybridize to the probe. For example, synthetic 
oligonucleotides are prepared which correspond to the ACAT EST sequences. The 
oligonucleotides are used as primers in polymerase chain reaction (PCR) techniques to 



8 



WO 99/63096 PCT/US99/12541 



obtain 5* and 3' terminal sequence of AC AT genes. Alternatively, where 
oligonucleotides of low degeneracy can be prepared from particular ACAT peptides, such 
probes may be used directly to screen gene libraries for ACAT gene sequences. In 
particular, screening of cDNA libraries in phage vectors is useful in such methods due to 
5 lower levels of background hybridization. 

Typically, a ACAT sequence obtainable from the use of nucleic acid probes will 
show 60-70% sequence identity between the target ACAT sequence and the encoding 
sequence used as a probe. However, lengthy sequences with as little as 50-60% sequence 
identity may also be obtained. The nucleic acid probes may be a lengthy fragment of the 

10 nucleic acid sequence, or may also be a shorter, oligonucleotide probe. When longer 
nucleic acid fragments are employed as probes (greater than about 100 bp), one may 
screen at lower stringencies in order to obtain sequences from the target sample which 
have 20-50% deviation (i.e., 50-80% sequence homology) from the sequences used as 
probe. Oligonucleotide probes can be considerably shorter than the entire nucleic acid 

15 sequence encoding an ACAT enzyme, but should be at least about 10, preferably at least 
about 15, and more preferably at least about 20 nucleotides. A higher degree of sequence 
identity is desired when shorter regions are used as opposed to longer regions. It may thus 
be desirable to identify regions of highly conserved amino acid sequence to design 
oligonucleotide probes for detecting and recovering other related ACAT genes. Shorter 

2 0 probes are often particularly useful for polymerase chain reactions (PCR), especially when 

highly conserved sequences can be identified. (See. Gould, et aL, PNAS USA ( 1989) 
86: 1934- 1938.). 

Another aspect of the present invention relates to ACAT polypeptides. Such 
polypeptides include isolated polypeptides set forth in the Sequence Listing, as well as 
25 polypeptides and fragments thereof, particularly those polypeptides which exhibit ACAT 
activity and also those polypeptides which have at least 50%, 60% or 70% identity, 
preferably at least 80% identity, more preferably at least 90% identity, and most 
preferably at least 95% identity to a polypeptide sequence selected from the group of 
sequences set forth in the Sequence Listing, and also include portions of such 

3 0 polypeptides, wherein such portion of the polypeptide preferably includes at least 30 

amino acids and more preferably includes at least 50 amino acids. 

"Identity", as is well understood in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by 
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comparing the sequences. In the art, ''identity" also means the degree of sequence 
relatedness between polypeptide or polynucleotide sequences, as determined by the match 
between strings of such sequences. "Identity" can be readily calculated by known 
methods including, but not limited to, those described in Computational Molecular 
5 Biology, Lesk, A.M., ed., Oxford University Press, New York (1988); Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M. and Griffin, H.G., eds., 
Humana Press, New Jersey (1994); Sequence Analysis in Molecular Biology, von Heinje, 
G., Academic Press ( 1987); Sequence Analysis Primer, Gribskov, M. and Devereux, J., 

10 eds., Stockton Press, New York (1991); and Carillo, H., and Lipman, D., SIAM J Applied 
Math, 48: 1073 (1988). Methods to determine identity are designed to give the largest 
match between the sequences tested. Moreover, methods to determine identity are 
codified in publicly available programs. Computer programs which can be used to 
determine identity between two sequences include, but are not limited to, GCG 

15 (Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); suite of five BLAST 
programs, three designed for nucleotide sequences queries (BLASTN, BLASTX, and 
TBLASTX) and two designed for protein sequence queries (BLASTP and TBLASTN) 
(Coulson, Trends in Biotechnology, 12: 76-80 (1994); Birren, et al.. Genome Analysis, 1: 
543-559 (1997)). The BLAST X program is publicly available from NCBI and other 

2 0 sources {BLAST Manual, Altschul, S., et aL, NCBI NLM NIH, Bethesda, MD 20894; 

Altschul, S.. et aL. 7. MoL BioL, 215:403-410 (1990)). The well known Smith Waterman 
algorithm can also be used to determine identity. 

Parameters for polypeptide sequence comparison typically include the following: 

Algorithm: Needleman and Wunsch, J. MoL BioL 48:443-453 (1970) 
25 Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. NatL 

Acad. Sci USA 89:10915-10919 (1992) 

Gap Penalty: 12 

Gap Length Penalty: 4 

A program which can be used with these parameters is publicly available as the 

3 0 "gap" program from Genetics Computer Group, Madison Wisconsin. The above 

parameters along with no penalty for end gap are the default parameters for peptide 
comparisons. 
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Parameters for polynucleotide sequence comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 

Comparison matrix: matches = +10; mismatches = 0 

Gap Penalty: 50 
5 Gap Length Penalty: 3 

A program which can be used with these parameters is publicly available as the 
"gap" program from Genetics Computer Group, Madison Wisconsin. The above 
parameters are the default parameters for nucleic acid comparisons. 

The invention also includes polypeptides of the formula: 

10 X-(R|)n-(R2)-(R 3 )n-Y 

wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is 
hydrogen or a metal, R ( and R3 are any amino acid residue, n is an integer between 1 and 
1000, and R 2 is an amino acid sequence of the invention, particularly an amino acid 
sequence selected from the group set forth in the Sequence Listing and preferably SEQ ID 

15 NOs: 2 and 17. In the formula, R2 is oriented so that its amino terminal residue is at the 

left, bound to Rj, and its carboxy terminal residue is at the right, bound to R 3 . Any stretch 
of amino acid residues denoted by either R group, where R is greater than I, may be either 
a heteropolymer or a homopolymer, preferably aheteropolymer. 

Polypeptides of the present invention include isolated polypeptides encoded by a 

2 0 polynucleotide comprising a sequence selected from the group of a sequence contained in 
the Sequence Listing set forth herein . 

The polypeptides of the present invention can be mature protein or can be part of a 
fusion protein. 

Fragments and variants of the polypeptides are also considered to be a part of the 

2 5 invention. A fragment is a variant polypeptide which has an amino acid sequence that is 

entirely the same as part but not all of the amino acid sequence of the previously 
described polypeptides. The fragments can be "free-standing" or comprised within a 
larger polypeptide of which the fragment forms a part or a region, most preferably as a 
single continuous region. Preferred fragments are biologically active fragments which are 

3 0 those fragments that mediate activities of the polypeptides of the invention, including 

those with similar activity or improved activity or with a decreased activity. Also 
included are those fragments that antigenic or immunogenic in an animal, particularly a 
human. 
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Variants of the polypeptide also include polypeptides that vary from the sequences 
set forth in the Sequence Listing by conservative amino acid substitutions, substitution of 
a residue by another with like characteristics. In general, such substitutions are among 
Ala, Val, Leu and He; between Ser and Thr; between Asp and Glu; between Asn and Gin; 
between Lys and Arg; or between Phe and Tyr. Particularly preferred are variants in 
which 5 to 10; 1 to 5; 1 to 3 or one amino acid(s) are substituted, deleted, or added, in any 
combination. 

Variants that are fragments of the polypeptides of the invention can be used to 
produce the corresponding full length polypeptide by peptide synthesis. Therefore, these 
variants can be used as intermediates for producing the full-length polypeptides of the 
invention. 

The polynucleotides and polypeptides of the invention can be used, for example, 
in the transformation of host cells, such as plant host cells, as further discussed herein. 

The invention also provides polynucleotides that encode a polypeptide that is a 
mature protein plus additional amino or carboxyl-terminal amino acids, or amino acids 
within the mature polypeptide (for example, when the mature form of the protein has 
more than one polypeptide chain). Such sequences can. for example, play a role in the 
processing of a protein from a precursor to a mature form, allow protein transport, shorten 
or lengthen protein half-life, or facilitate manipulation of the protein in assays or 
production. It is contemplated that cellular enzymes can be used to remove any additional 
amino acids from the mature protein. 

A precursor protein, having the mature form of the polypeptide fused to one or 
more prosequences may be an inactive form of the polypeptide. The inactive precursors 
generally are activated when the prosequences are removed. Some or all of the 
prosequences may be removed prior to activation. Such precursor protein are generally 
called proproteins. 

Plant Constructs and Methods of Use 

Of particular interest is the use of the nucleotide sequences in recombinant DNA 
constructs to direct the transcription or transcription and translation (expression) of the 
acyltransferase sequences of the present invention in a host plant cell. The expression 
constructs generally comprise a promoter functional in a host plant cell operably linked to 
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a nucleic acid sequence encoding a acyl-CoA: cholesterol acyltransferase of the present 
invention and a transcriptional termination region functional in a host plant cell. 

Those skilled in the art will recognize that there are a number of promoters which 
are functional in plant cells, and have been described in the literature. Chloroplast and 
5 plastid specific promoters, chloroplast or plastid functional promoters, and chloroplast or 
plastid operable promoters are also envisioned. 

One set of promoters are constitutive promoters such as the CaMV35S or 
FMV35S promoters that yield high levels of expression in most plant organs. Enhanced or 
duplicated versions of the CaMV35S and FMV35S promoters are useful in the practice of 
10 this invention (Odell, et at (1985) Nature 313:810-812; Rogers, U.S. Patent Number 
5,378, 619). In addition, it may also be preferred to bring about expression of the 
acyltransferase gene in specific tissues of the plant, such as leaf, stem, root, tuber, seed, 
fruit, etc., and the promoter chosen should have the desired tissue and developmental 
specificity. 

15 Of particular interest is the expression of the nucleic acid sequences of the present 

invention from transcription initiation regions which are preferentially expressed in a 
plant seed tissue. Examples of such seed preferential transcription initiation sequences 
include those sequences derived from sequences encoding plant storage protein genes or 
from genes involved in fatty acid biosynthesis in oilseeds. Examples of such promoters 

2 0 include the 5* regulatory regions from such genes as napin Kridl et Seed Sci. Res. 
7;209:219 (1991)), phaseolin, zein, soybean trypsin inhibitor, ACP, stearoyl-ACP 
desaturase, soybean a' subunit of {3-conglycinin (soy 7s, (Chen et al., Proc. Natl. Acad. 
ScL, 83:8560-8564 (1986))) and oleosin. * 

It may be advantageous to direct the localization of proteins conferring ACAT to a 

2 5 particular subcellular compartment, for example, to the mitochondrion, endoplasmic 

reticulum, vacuoles, chloroplast or other plastidic compartment. For example, where the 
genes of interest of the present invention will be targeted to plastids, such as chloroplasts, 
for expression, the constructs will also employ the use of sequences to direct the gene to 
the plastid. Such sequences are referred to herein as chloroplast transit peptides (CTP) or 

3 0 plastid transit peptides (PTP). In this manner, where the gene of interest is not directly 

inserted into the plastid, the expression construct will additionally contain a gene 
encoding a transit peptide to direct the gene of interest to the plastid. The chloroplast 
transit peptides may be derived from the gene of interest, or may be derived from a 

13 

BNSDOCID: <WO 9963096A2 I > 



WO 99/63096 



PCT/US99/12541 



heterologous sequence having a CTP. Such transit peptides are known in the art. See, for 
example, Von Heijne et al (1991) Plant Mol Biol Rep. 9: 104-126; Clark et al ( 1989) J. 
Biol Chem. 264: 17544-17550; della-Cioppa «/. (1987) Plant Physiol 54:965-968; 
Romerer al (1993) Biochem. Biophys. Res Commun. 796:1414-1421; and, Shahetal. 
5 ( 1 986) Science 23 J:478-48 1 . 

Depending upon the intended use, the constructs may contain the nucleic acid 
sequence which encodes the entire ACAT protein, or a portion thereof. For example, 
where antisense inhibition of a given ACAT protein is desired, the entire ACAT sequence 
is not required. Furthermore, where ACAT sequences used in constructs are intended for 

10 use as probes, it may be advantageous to prepare constructs containing only a particular 
portion of a ACAT encoding sequence, for example a sequence which is discovered to 
encode a highly conserved ACAT region. 

The skilled artisan will recognize that there are various methods for the inhibition 
of expression of endogenous sequences in a host cell. Such methods include, but are not 

15 limited to, antisense suppression (Smith, et al (1988) Nature 334:724-726) , co- 
suppression (Napoli, et al (1989) Plant Cell 2:279-289), ribozymes (PCT Publication 
WO 97/10328), and combinations of sense and antisense Waterhouse, et al (1998) Proc. 
Natl Acad. Sci. USA 95: 13959-13964. Methods for the suppression of endogenous 
sequences in a host cell typically employ the transcription or transcription and translation 

2 0 of at least a portion of the sequence to be suppressed. Such sequences may be homologous 
to coding as well as non-coding regions of the endogenous sequence. 

Regulatory transcript termination regions may be provided in plant expression 
constructs of this invention as well. Transcript termination regions may be provided by 
the DNA sequence encoding the ACAT or a convenient transcription termination region 

2 5 derived from a different gene source, for example, the transcript termination region which 

is naturally associated with the transcript initiation region. The skilled artisan will 
recognize that any convenient transcript termination region which is capable of 
terminating transcription in a plant cell may be employed in the constructs of the present 
invention. 

3 0 Alternatively, constructs may be prepared to direct the expression of the ACAT 

sequences directly from the host plant cell plastid. Such constructs and methods are 
known in the art and are generally described, for example, in Svab, et al (1990) Proc. 
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Natl. Acad. ScL USA 87:8526-8530 and Svab and Maliga (1993) Proc. Natl. Acad. ScL 
USA 90:913-917 and in U.S. Patent Number 5,693,507. 

A plant cell, tissue, organ, or plant into which the recombinant DN A constructs 
containing the expression constructs have been introduced is considered transformed, 
5 transfected, or transgenic. A transgenic or transformed cell or plant also includes progeny 
of the cell or plant and progeny produced from a breeding program employing such a 
transgenic plant as a parent in a cross and exhibiting an altered phenotype resulting from 
the presence of a ACAT nucleic acid sequence. 

Plant expression or transcription constructs having a plant ACAT as the DNA 
1 0 sequence of interest for increased or decreased expression thereof may be employed with 
a wide variety of plant life, particularly, plant life involved in the production of vegetable 
oils for edible and industrial uses. Most especially preferred are temperate oilseed crops. 
Plants of interest include, but are not limited to, rapeseed (Canola and High Erucic Acid 
varieties), sunflower, safflower, cotton, soybean, peanut, coconut and oil palms, and corn. 
15 Depending on the method for introducing the recombinant constructs into the host cell, 
other DNA sequences may be required. Importantly, this invention is applicable to 
dicotyledyons and monocotyledons species alike and will be readily applicable to new 
and/or improved transformation and regulation techniques. 

Of particular interest, is the use of plant ACAT constructs in plants to produce 

2 0 plants or plant parts, including, but not limited to leaves, stems, roots, reproductive, and 

seed, with a modified content of lipid and/or sterol esters. 

For immunological screening, antibodies to the protein can be prepared by 
injecting rabbits or mice with the purified protein or portion thereof, such methods of 
preparing antibodies being well known to those in the art. Either monoclonal or 
25 polyclonal antibodies can be produced, although typically polyclonal antibodies are more 
useful for gene isolation. Western analysis may be conducted to determine that a related 
protein is present in a crude extract of the desired plant species, as determined by cross- 
reaction with the antibodies to the encoded proteins. When cross-reactivity is observed, 
genes encoding the related proteins are isolated by screening expression libraries 

3 0 representing the desired plant species. Expression libraries can be constructed in a variety 

of commercially available vectors, including lambda gtl 1, as described in Sambrook, et 
al. (Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York). 

15 
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To confirm the activity and specificity of the proteins encoded by the identified 
nucleic acid sequences as acyltransferase enzymes, in vitro assays are performed in insect 
cell cultures using baculovirus expression systems. Such baculovirus expression systems 
are known in the art and are described by Lee, et al U.S. Patent Number 5,348,886, the 
5 -entirety of which is herein incorporated by reference. 

In addition, other expression constructs may be prepared to assay for protein 
activity utilizing different expression systems. Such expression constructs are 
transformed into yeast or prokaryotic host and assayed for acyltransferase activity. Such 
expression systems are known in the art and are readily available through commercial 
10 sources. 

Also of interest in the present invention is the use of such acyl CoAxholesterol 
acyltransferase encoding nucleic acid sequences for the preparation of expression 
constructs to increase the production of triacylglycerol in plant. Such ACAT nucleic acid 
sequences may also encode sequences of amino acids, such as, a protein, polypeptide, or 

15 peptide, which demonstrates the ability to acylate the sn-3 position of sn- 1,2- 

diacylglycerol under plant enzyme conditions. Such DAGAT sequences will find use in 
a variety of applications related to production of TAG in host cells. As DAGAT directed 
lipid synthesis towards TAG production and away from membrane lipid production, a 
number of uses for increasing TAG production in cells normally capable of producing 

2 0 TAG or providing TAG production in cells not normally capable of making TAG are 
considered. 

For example, expression of a laurate-specific DAGAT may be used to provide for 
increased production of laurate ( 1 2:0) fatty acids in TAG in plant cells transformed to 
express a thioesterase specific for laurate production. Such plants are described, for 

2 5 example, in USPN 5298,42 1 . Plant sources for such laurate preferring DAGATs may 

include California bay, Cuphea species, and coconut. 

Similarly, for increased production of stearate (18:0) fatty acids in plant seed oils, 
a DAGAT having preferential activity on stearate -containing DAG may be found, for 
example, in various tropical plant species such as Garcinia species, including mangosteen 

3 0 and kokum, plants of the Mangifera family, such as mango, and various other tropical 

plants including Butyrospermum (shea), Pentadesma (tallow tree), Illipe (illipe butter), 
Theobroma (cocoa). Simarouha (tree of paradise) and Shorea (sal). 
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In addition to the sequences described in the present invention, DNA coding 
sequences useful in the present invention can be derived from algae, fungi, bacteria, 
mammalian sources, plants, etc. Homology searches in existing databases using 
signature sequences corresponding to conserved nucleotide and amino acid sequences of 
5 ACAT can be employed to isolate equivalent, related genes from other sources such as 
plants and microorganisms. Searches in EST databases can also be employed. 
Furthermore, the use of DNA sequences encoding enzymes functionally enzymatically 
equivalent to those disclosed herein, wherein such DNA sequences are degenerate 
equivalents of the nucleic acid sequences disclosed herein in accordance with the 

10 degeneracy of the genetic code, is also encompassed by the present invention. 

Demonstration of the functionality of coding sequences identified by any of these 
methods can be carried out by complementation of mutants of appropriate organisms, 
such as Synechocystis, Shewanella, yeast, Pseudomonas, Rhodobacteria, etc., that lack 
specific biochemical reactions, or that have been mutated. The sequences of the DNA 

15 coding regions can be optimized by gene resynthesis, based on codon usage, for 
maximum expression in particular hosts. 

For the alteration of sterol and sterol ester production in a host cell, a second 
expression construct can be used in accordance with the present invention. For example, 
the ACAT expression construct can be introduced into a host cell in conjunction with a 

20 second expression construct having a nucleotide sequence for a protein involved in sterol 
biosynthesis. 

In order to increase triglyceride (Also referred to herein as TAG) biosynthesis, and 
thereby increasing fatty acids, in a plant tissue, coexpression of a plant or other ACAT in 
a plant tissue with a second gene involved in fatty acid biosynthesis may also find use in 

25 the present invention. For example, coexpression of a ACAT sequence in plant seed 

tissue with a DNA sequence encoding for another protein involved in TAG biosynthesis, 
such as LPAAT (U.S. Patent Application 07/458,109, the entirety of which is 
incorporated herein by reference) may increase the flux through thekennedy pathway and 
increase the total fatty acids produced in the seed tissue. 

3 0 Furthermore, for increased production of a particular chain length fatty acid, for 

example medium-chain fatty acids, coexpression of a plant or other ACAT in a plant 
tissue with a second DNA sequence encoding for enzymes involved in the production of 
medium-chain, or other chain length, fatty acids may find use in the present invention. 
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DNA sequences encoding for thioesterases (for example USPN 5,298,421, USPN 
5,667,997 the entirety of which are incorporated herein by reference) or fatty acid 
synthases (U.S. Patent Application Serial Number 08/827,828 the entirety of which is 
incorporated herein by reference) are examples of enzymes involved in the production of 
5 various chain length fatty acids. 

The method of transformation in obtaining such transgenic plants is not critical to 
the instant invention, and various methods of plant transformation are currently available. 
Furthermore, as newer methods become available to transform crops, they may also be 
directly applied hereunder. For example, many plant species naturally susceptible to 

10 Agrobacterium infection may be successfully transformed via tripartite or binary vector 
methods of Agrobacterium mediated transformation. In many instances, it will be 
desirable to have the construct bordered on one or both sides by T-DNA, particularly 
having the left and right borders, more particularly the right border. This is particularly 
useful when the construct uses A. tumefaciens or A. rhizogenes as a mode for 

15 transformation, although the T-DNA borders may find use with other modes of 

transformation. In addition, techniques of microinjection, DNA particle bombardment, 
and electroporation have been developed which allow for the transformation of various 
monocot and dicot plant species. 

Normally, included with the DNA construct will be a structural gene having the 

2 0 necessary regulatory regions for expression in a host and providing for selection of 

transformant cells. The gene may provide for resistance to a cytotoxic agent e.g. 
antibiotic, heavy metal, toxin, etc., complementation providing prototrophy to an 
auxotrophic host, viral immunity or the like. Depending upon the number of different 
host species the expression construct or components thereof are introduced, one or more 
25 markers may be employed, where different conditions for selection are used for the 
different hosts. 

Where Agrobacterium is used for plant cell transformation, a vector may be used 
which may be introduced into the Agrobacterium host for homologous recombination 
with T-DNA or the Ti- or Ri-plasmid present in the Agrobacterium host. The Ti- or Ri- 

3 0 plasmid containing the T-DNA for recombination may be armed (capable of causing gall 

formation) or disarmed (incapable of causing gall formation), the latter being permissible, 
so long as the vir genes are present in the transformed Agrobacterium host. The armed 
plasmid can give a mixture of normal plant cells and gall. 
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In some instances where Agrobacterium is used as the vehicle for transforming 
host plant cells, the expression or transcription construct bordered by the T-DNA border 
region(s) will be inserted into a broad host range vector capable of replication in E. coll 
and Agrobacterium, there being broad host range vectors described in the literature. 
5 Commonly used is pRK2 or derivatives thereof. See, for example, Ditta, et aL, (Proc. 

Nat. Acad. ScL, U.S.A. (1980) 77:7347-7351) and EPA 0 120 515, which are incorporated 
herein by reference. Alternatively, one may insert the sequences to be expressed in plant 
cells into a vector containing separate replication sequences, one of which stabilizes the 
vector in E. coli, and the other in Agrobacterium. See, for example, McBride and 

10 Summerfelt {Plant Mol. Biol. (1990) 74:269-276), wherein the pRiHRI (Jouanin, et aL, 
Mol. Gen. Genet. (1985) 201:370-374) origin of replication is utilized and provides for 
added stability of the plant expression vectors in host Agrobacterium cells. 

Included with the expression construct and the T-DNA will be one or more 
markers, which allow for selection of transformed Agrobacterium and transformed plant 

15 cells. A 

number of markers have been developed for use with plant cells, such as resistance to 
chloramphenicol, kanamycin, the aminoglycoside G418, hygromycin, or the like. The 
particular marker employed is not essential to this invention, one or another marker being 
preferred depending on the particular host and the manner of construction. 
2 0 For transformation of plant cells using Agrobacterium, explants may be combined 

and incubated with the transformed Agrobacterium for sufficient time for transformation, 
the bacteria killed, and the plant cells cultured in an appropriate selective medium. Once 
callus forms, shoot formation can be encouraged by employing the appropriate plant 
hormones in accordance with known methods and the shoots transferred to rooting 

2 5 medium for regeneration of plants. The plants may then be grown to seed and the seed 

used to establish repetitive generations and for isolation of vegetable oils. 

There are several possible ways to obtain the plant cells of this invention which 
contain multiple expression constructs. Any means for producing a plant comprising a 
construct having a DNA sequence encoding the expression construct of the present 

3 0 invention, and at least one other construct having another DNA sequence encoding an 

enzyme are encompassed by the present invention. For example, the expression construct 
can be used to transform a plant at the same time as the second construct either by 
inclusion of both expression constructs in a single transformation vector or by using 
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separate vectors, each of which express desired genes. The second construct can be 
introduced into a plant which has already been transformed with the ACAT expression 
construct, or alternatively, transformed plants, one expressing the ACAT construct and 
one expressing the second construct, can be crossed to bring the constructs together in the 
5 same plant. 

The invention also relates to vectors that include a polynucleotide or 
polynucleotides of the invention, host cells that are genetically engineered with vectors of 
the invention and the production of polypeptides of the invention by recombinant 
techniques. Cell free translation systems can be employed to produce such protein using 

10 RNAs derived from the DNA constructs of the invention. 

For recombinant production, host cells can be genetically engineered to 
incorporate expression systems or portions thereof or polynucleotides of the present 
invention. Introduction of a polynucleotide into a host cell can be effected by methods 
described in many standard laboratory manuals, such as Davis et al., Basic Methods in 

15 Molecular Biology, ( 1986) and Sambrook et al, Molecular Cloning: A Laboratory 

Manual, 2 nd Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor NY 
(1989). Such methods include, but are not limited to. calcium phosphate transfection, 
DEAE dextran mediated transfection, transvection. microinjection, cationic lipid- 
mediated transfection, electroporation, transduction, scrape loading ballistic introduction 

20 and infection. 

Representative examples of appropriate hosts include bacterial cells, such as 
streptococci, staphylococci, enterococcL E. coli, streptomyces, and Bacillus subtilis cells; 
fungal cells, such as yeast cells and Aspergillus cells; insect cells, such as Drosophila S2 
and Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, CI 27, 3T3, BHK, 293 

25 and Bowes melanoma cells; and plant cells as described above. 

A variety of expression systems can be used to produce the polypeptides of the 
invention. Such vectors include, but are not limited to, chromosomal, episomal, and virus 
derived vectors, for example vectors from bacterial plasmids, bacteriophage, transposons, 
yeast episomes, insertion elements, yeast chromosomal elements, viruses such as 

3 0 baculoviruses, papova viruses, such as SB40, vaccinia viruses, adenoviruses, fowl pox 

viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations of 
such viruses, such as those derived from plasmid and bacteriophage genetic elements, 
such as cosmids and phagemids. The expression system constructs may contain control 
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regions that regulate as well as engender expression. Generally, any system or vector 
which is suitable to maintain, propagate or express polynucleotides and/or to express a 
polypeptide in a host can be used for expression. The appropriate DN A sequence can be 
inserted into the chosen expression by any of a variety of well-known and routine 
5 techniques, such as, for example, those set forth in Sambrook et al, Molecular Cloning, A 
Laboratory Manual^ {supra). 

Appropriate secretion signals, either homologous or heterologous, can be 
incorporated into the expressed polypeptide to allow the secretion of the protein into the 
lumen of the endoplasmic reticulum, the periplasmic space or the extracellular 

10 environment. 

The polypeptides of the present invention can be recovered and purified from 
recombinant cell cultures by any of a number of well known methods, including, but not 
limited to, ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic interaction 

1 5 chromatography, affinity chromatography, hydroxylapatite chromatography, and lectin 
chromatography. It is most preferable to use high performance liquid chromatography 
(HPLC) for purification. Any of the well known techniques for protein refolding can be 
used to regenerate an active confirmation if the polypeptide is denatured during isolation 
and/or purification. 

2 0 This invention is also related to the use of the polynucleotides of the invention as 

diagnostic reagents. Detection of a mutated form of a gene can be used as a diagnostic 
tool that to assist in the diagnosis of a disease or of susceptibility to a disease which 
results from under-expression, over-expression or altered expression of the gene. A 
variety of well known techniques can be used to detect, at the DNA level, an individual 

2 5 who has a mutation in the gene. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells 
and tissues, such as bone, blood, muscle, cartilage and skin. Genomic DNA can be used 
directly for detection or can be amplified prior to analysis using PCR or other 
amplification techniques. RNA or cDNA can also be used in the same manner. Deletions 

3 0 and insertions can be detected by a change in the size of the amplified product as 

compared to the genotype of a reference sequence. Point mutations can be identified by 
hybridizing amplified DNA to labeled polynucleotide sequences of the invention. 
Sequences that are perfectly matched can be distinguished from mismatched duplexes by 
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RNase digestion or by differences in the melting temperature. Sequence differences can 
also be detected, at the DNA level, by comparing electrophoretic mobility of DNA 
fragments in gels, with or without denaturing agents; or by direct DNA sequencing (See, 
for example, Myers et al.. Science 230: 1242 (1985)). A sequence change at a particular 
5 location can also be detected using nuclease protection assays, such as RNase and SI 
protection or a chemical cleavage method (See, for example, Cotton et aL, Proc\ Natl 
Acad, 5c/., USA, 85: 4397-4401 (1985). It is anticipated that an array of oligonucleotide 
probes comprising a DAGAT nucleotide sequence or fragments thereof can be used for 
screening, particularly for genetic mutations. Array technology methods are well known 

10 and are useful in gene expression, genetic linkage and genetic variability analyses (See, 
for example, M. Chee et aL Science, 274: 610-613 (1996)). 

The invention further provides a method for diagnosing or determining a 
susceptibility to a disease associated with DAGAT activity, particularly diseases 
associated with altered cellular diacylglycerol concentration or protein kinase C activity, 

15 including, but not limited to cancer; diabetes; cardiopulmonary diseases including, but not 
limited to heart failure, and atherosclerosis; adipocytosis; leukemia and skin carcinoma; 
fibroblastoma; metabolic disorders; obesity; diseases associated with abnormal lipid 
metabolism; diseases associated with abnormal fat absorption, lipoprotein secretion and 
adipogenesis, by determining from a sample an abnormally altered level of polypeptide or 

20 mRNA. Altered expression can be measured at the RNA level by any of the techniques 
well known in the art for quantitation of polynucleotides, including, but not limited to, 
amplification, PCR, RT-PCR, RNase protection, Northern blotting and other 
hybridization methods. Diagnostic assays are also contemplated which detect levels of 
protein expression including, but not limited to radioimmunoassays, competitive-binding 

2 5 assays, Western Blot analysis and ELISA assays. 

The nucleotide sequences of the present invention can also be used in 
chromosome identification. 

The polypeptides of the invention or variants thereof, or cells expressing them can 
be used as an immunogen to produce antibodies which are immunospecific for 

3 0 polypeptides of the present invention. "Immunospecific" means that the antibodies have a 

substantially greater affinity for the polypeptides of the present invention as compared to 
the affinity of the antibodies for other related polypeptides. iK Antibodies^ includes 
monoclonal and polyclonal antibodies, including chimeric, single chain, simianized, 
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humanized, resurfaced and other types of complementarity determining region (CDR) 
replaced antibodies, as well as Fab fragments ; including products of an Fab 
immunoglobulin expression library. 

Antibodies can be obtained by administering the polypeptides or epitope bearing 
5 fragments, analogs or cells to an animal, preferably non-human, using routine protocols. 
Any of the well known techniques continuous cell culturing techniques can be used to 
prepare monoclonal antibodies including hybridoma technology (See for example, Kohler, 
G. and Milstein, C, Nature 256:495-497 (1975)); trioma technology; human B-cell 
hybridoma technology (Kozbor et al., Immunology Today 4:72 (1983)); and the EBV- 
10 hybridoma technology (Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, 77-96,(1985)). 

Single chain, humanized, resurfaced, simianized and other types of CDR replaced 
antibodies can be produced according to techniques which are well known in the art. 

The described antibodies can be used to isolate or identify clones that express the 

15 polypeptide or to purify polypeptides by affinity chromatography. The antibodies can also 
be used to treat diseases associated with DAGAT activity, particularly diseases associated 
with altered cellular diacylglycerol concentration or protein kinase C activity, including, 
but not limited to cancer; diabetes; cardiopulmonary diseases including, but not limited to 
heart failure, and atherosclerosis; adipocytosis; leukemia and skin carcinoma; 

2 0 fibroblastoma; metabolic disorders; obesity; diseases associated with abnormal lipid 

metabolism; diseases associated with abnormal fat absorption, lipoprotein secretion and 
adipogenesis. 

The present invention also relates to genetically engineered soluble fusion proteins 
which comprises a polypeptide of the present invention, or a fragment thereof, fused to 

2 5 portions of the constant regions of the heavy or light chains of immunoglobulins of the 

various subclasses (IgG, IgM, IgA and IgE). Preferably the constant portion of the heavy 
chain of human IgG, particularly IgGl, is used with fusion at the hinge region. 
Particularly preferred is the use of Fc portion. (See, for example, WO 94/29458 and WO 
94/22914) 

3 o Polypeptides of the present invention can also be used to identify compounds 

which bind to the polypeptide, and in particular, inhibit or stimulate the activity of the 
polypeptide by binding. The binding of small molecule substrates and ligands can be 
assessed in, for example, cells, cell-free preparations, chemical libraries, and natural 
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product mixtures. The agonists or antagonists/inhibitors can be natural substrates or 
ligands or can be structural or functional mimetics thereof. See, for example, Coligan et 
al., Curr Prot in Immuno, l(2):Chapter 5 (1991). 

The invention also provides a method for screening compounds to identify those 
5 compounds that bind to the polypeptides or polynucleotides of the present invention and 
particularly those compounds that enhance (agonist) or inhibit (antagonist) the action of 
polypeptides or polynucleotides of the invention. High throughput screening techniques 
can be used. As an example, to screen for agonists or antagonists, a synthetic reaction 
mix, a cellular compartment, such as a membrane, cell envelope or cell wall, or a 

10 preparation of any of these, comprising a polypeptide of the invention and a labeled 
substrate or ligand of such polypeptide is incubated in the absence or presence of a 
candidate compound that is being screening. The ability of the candidate compound to 
agonize or antagonize a polypeptide of the invention is detected by a decrease in binding 
of the labeled ligand or a decrease in the production of product from the substrate. 

15 Candidate compounds that bind gratuitously, without inducing the effects of a polypeptide 
of the invention, are most likely to be good antagonists. On the other hand, compounds 
that bind well and increase the rate of product production from substrate are considered 
agonists. The detection of the rate or level of production of product from substrate can be 
enhanced by using a reporter system such as, but not limited to, colorimetric labeling, 

2 0 inclusion of a reporter gene that is responsive to changes in polynucleotide or polypeptide 
activity and binding assays known in the art. 

Competitive assays that combine a polypeptide of the invention and a potential 
antagonist with a compound that binds the polypeptide, natural substrates or ligands. or 
substrate or ligand mimetics can also be used to screen for antagonist compounds. The 

2 5 polypeptide of the invention can be label, such as by radioactivity or colorimetric 

compound, such that the number of such polypeptide molecules that bound to the binding 
molecule or converted to product can be determined to assess the effectiveness of the 
potential antagonist. 

Potential antagonists can include, but are not limited to, small organic molecules, 

3 0 peptides, polypeptides and antibodies that bind to a polynucleotide or polypeptide of the 

invention and thereby inhibit or partially or completely block its activity. Antagonists can 
also include small organic molecules, peptides, polypeptides and antibodies that bind to 
the same site on a binding molecule without inducing the activities that are induced by a 
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polypeptide of the invention, thereby preventing the action of the polypeptide by blocking 
it from binding. Potential antagonists also include small molecules that bind to and 
occupy the binding site of the polypeptide thereby preventing the polypeptide from 
binding to cellular binding molecules, so as to prevent or reduce normal biological 
5 activity of the polypeptide. Examples of such small molecules include, but are not limited 
to, small organic molecules, peptides and peptide like molecules. Other potential 
antagonists include antisense molecules (see, for example, Okano, 7. Neurochem, 56: 560 
(1991); Oligodeoxy nucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL(1988)). 

10 Antagonists and agonists of DAGAT activity are particular useful as DAGAT is 

important in the formation of chylomicra in small intestine. VLDL in liver, and for 
storage of energy as triacylglycerol in adipose tissue. Thus, inhibiting DAGAT activity in 
small intestine, liver, and adipose tissues will reduce lipid absorption and plasma 
triglyceride levels and will decrease adipogenesis. Further, hypertriglyceridemia has been 

15 shown to be an independent risk factor for atherosclerosis (Kugiyama, K., et al., (1998) 
Circulation 97:2519-2526,) and is a marker for increased risk of coronary artery disease 
and can serve as a marker for several atherogenic factors. (Grundy, S.M., (1998) Am. 7. 
Cardiol, 81:18B-25B). Compounds that inhibit DAGAT activity are also useful in 
controlling intestinal fat absorption, altering TAG rich lipoprotein secretion and 

2 0 controlling serum TAG, and reducing adipogenesis (Owen MR, et al. (1997) Biochem 7 

323:17-21, Jamdar SC and Cao WF (1995) Biochim Biophys Acta 1255:237-243). 
Furthermore, the diacylglycerol substrate of DAGAT is a signal transducing molecule 
within the cell and is a known modulator of protein kinase C activity. Altered cellular 
diacylglycerol concentration and PROTEIN KINASE C activity has been associated with 
25 cancer (da Costa et al.,(1993) 7. BioL Chem. 268:2100-2105), diabetes (Koya D and King 
GL (1998) Diabetes 47:859-866), heart failure (Okumura, et al., (1991) 7. MoL Cell. 
Cardiol 23:409-416), adipocyte (Baldo et aL, (1995) 7. Lipid Res., 36: 1415-1426), 
leukemia and skin carcinoma cells (Goldkorn T., and Ding, T. (1997) Adv. Exp, Med. 
Biol, 400A:46 1-472), and rat fibroblasts (Pai et al., (1991) Proc. Natl Acad. Sci., 

3 0 88:598-602). As such, agonists and antagonists of the invention are particularly useful in 

treating or ameliorating diseases associated with DAGAT activity, including diseases 
associated with altered cellular diacylglycerol concentration or protein kinase C activity, 
including, but not limited to cancer: diabetes: cardiopulmonary diseases including, but not 
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limited to heart failure, and atherosclerosis; adipocytosis; leukemia and skin carcinoma; 
fibroblastoma; metabolic disorders; obesity; diseases associated with abnormal lipid 
metabolism; diseases associated with abnormal fat absorption, lipoprotein secretion and 
adipogenesis. 

The invention also relates to compositions comprising the polynucleotide or the 
polypeptide, or variants, agonists or antagonists thereof. The polypeptides of the 
invention can be used in combination with a sterile or non-sterile carrier or carriers for use 
with cells, tissues or organisms, such as a pharmaceutical carrier suitable for 
administration to a subject. Such compositions comprise, for example, a therapeutically 
effective amount of a polypeptide or other compound of the invention and a 
pharmaceutical^ acceptable carrier orexcipient. Such carriers include, but are not 
limited to, saline, buffered saline, dextrose, water, glycerol, ethanol and combinations 
thereof. The formulation should be consistent with the mode of administration. The 
invention further relates to diagnostic and pharmaceutical packs or kits comprising one or 
more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 

Polypeptides and other compounds of the invention may be administered alone or 
in combination with other compounds. 

The pharmaceutical compositions can be administered in any effective, convenient 
manner including, but not limited to, topical, oral, anal, vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes. 

The required dosage range will depend on the peptide or other compound of the 
present invention that is used, the route of administration, the nature of the formulation, 
the nature of the subject's condition and the judgment of the practitioner. Suitable 
dosages will generally be in the range of approximately 0.1 to lOOjig/kg. The large 
variation in the dosage is expected due to the variety of compounds and the differences in 
the efficacy of administration. As an example, it is expected that oral administration 
would require higher dosages than intravenous administration. The skilled practitioner 
can determine the appropriate dosage using standard empirical methods. 

Polypeptides can also be generated endogenously in the subject, which is generally 
referred to as "gene therapy" For example, cells from a subject may be engineered with a 
polynucleotide, such as a DNA or RNA, to encode a polypeptide, ex vivo, and by the use 
of a retroviral plasmid vector. The cells are then introduced into the subject. 
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The polynucleotide and polypeptide sequences can also be used to identify 
additional sequences which are homologous to the sequences of the present invention. 
The most preferable and convenient method is to store the sequence in a computer 
readable medium, for example, floppy disk, CD ROM, hard disk drives, external disk 
5 drives and DVD, and then to use the stored sequence to search a sequence database with 
well known searching tools. Examples of public databases include the DNA Database of 
Japan (DDBJ)(http://www.ddbj.nig.ac.jp/); Genebank 

(http://www. ncbi.nlm.nih.gov/web/Genbank/Index.htlm); and the European Molecular 
Biology Laboratory Nucleic Acid Sequence Database (EMBL) 

10 ( http://www.ebi.ac.uk/ebi docs/embl db.html) . A number of different search algorithms 
are available to the skilled artisan, one example of which are the suite of programs 
referred to as BLAST programs. There are five implementations of BLAST, three 
designed for nucleotide sequences queries (BLASTN, BLASTX, and TBLASTX) and two 
designed for protein sequence queries (BLASTP and TBLASTN) (Coulson, Trends in 

15 Biotechnology, 12: 76-80 (1994); Birren, et al t Genome Analysis, 1: 543-559 (1997)). 

Additional programs are available in the art for the analysis of identified sequences, such 
as sequence alignment programs, programs for the identification of more distantly related 
sequences, and the like, and are well known to the skilled artisan. 

The invention now being generally described, it will be more readily understood 

20 by reference to the following examples which are included for purposes of illustration 
only and are not intended to limit the present invention. 

EXAMPLES 

Example 1 RNA Isolations 

2 5 Total RNA from the inflorescence and developing seeds of Arabidopsis thaliana 

is isolated for use in construction of complementary (cDNA) libraries. The procedure is 
an adaptation of the DNA isolation protocol of Webb and Knapp (D.M. Webb and SJ. 
Knapp, (1990) Plant Molec. Reporter, 8, 180-185). The following description assumes 
the use of lg fresh weight of tissue. Frozen seed tissue is powdered by grinding under 

3 0 liquid nitrogen. The powder is added to 10ml REC buffer (50mM Tris-HCl, pH 9, 0.8M 

NaCl, lOmM EDTA, 0.5% w/v CTAB (cetyltrimethyi-ammonium bromide)) along with 
0.2g insoluble polyvinylpolypyrrolidone, and ground at room temperature. The 
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homogenate is centrifuged for 5 minutes at 12,000 xg to pellet insoluble material. The 
resulting supernatant fraction is extracted with chloroform, and the top phase is recovered. 

The RNA is then precipitated by addition of 1 volume RecP (50mM Tris-HCL 
pH9, lOmM EDTA and 0.5% (w/v) CTAB) and collected by brief centrifugation as 
5 before. The RNA pellet is redissolved in 0.4 ml of 1M NaCl. The RNA pellet is 

redissolved in water and extracted with phenol/chloroform. Sufficient 3M potassium 
acetate (pH 5) is added to make the mixture 0.3M in acetate, followed by addition of two 
volumes of ethanol to precipitate the RNA. After washing with ethanol, this final RNA 
precipitate is dissolved in water and stored frozen. 

10 

Example 2 Identification of ACAT Sequences 

Since plant ACATs are unknown in the art, searches are performed to identify 
known and related ACAT sequences from mammalian sources from public databases. 
These sequences are then used to search public and proprietary EST databases to identify 

15 plant ACAT-like sequences. 

A public database containing mouse Expressed Sequence Tag (EST) sequences 
(dBEST) is searched for ACAT-like sequences. The search identified two sequences 
(SEQ ID NOs: 12 and 13) which are related (approximately 20% identical), but divergent, 
to known ACAT sequences. 

2 0 In order to identify ACAT-like sequences from other organisms, the two mouse 

ACAT sequences are used to search public and proprietary databases containing EST 
sequences from human and rat tissues. Results of the search identified approximately 180 
sequences from the human database, which were assembled into a complete inferred 
cDNA sequence (Figure 7) (SEQ ID NO: 14) and approximately 35 from the rat database, 

2 5 also assembled into an inferred cDN A sequence using the GCG assembly program 

(Figure 8) (SEQ ID NO: 15) which are closely related to the mouse sequences. 

The protein sequence of the human ACAT-like sequence was aligned with known 
ACAT sequences from human (Chang, et al (1993) J. Biol. Chem. 268:20747-20755, 
(SEQ ID NO:22)), mouse (Uelmen, et al (1995) J. Biol Chem. 270:26192-26201, (SEQ 

3 0 ID NO:23)) and yeast (Yu, et al (\996)J.Biol Chem. 271:24157-24163, (SEQ ID 

NO:24) and Yang, et al (1996) Science 272:1353-1356, (SEQ ID NO:25)) using 
Mac Vector (Oxford Molecular, Inc. ). Results of the alignment (Figure 9) suggests that 
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the sequence is related to the known sequences, however the related sequence is only 
about 25% similar to the known sequences. 

The protein sequence of the human sterol O-acyltransferase (ACAT, Acyi 
CoA:Cholesterol acyltransferase. Accession number A48026) ) (SEQ ID NO: 15) related 
5 sequence was used to search protein and nucleic acid Genbank databases. A single plant 
homologue (Figure 1) was identified in the public Arabidopsis EST database (Accession 
A042298, SEQ ID NO: 1 ). The protein sequence was translated from the EST sequence, 
and was found to contain a peptide sequence conserved in both mammalian and yeast 
ACATs (Chang et aL, (1997) Amu Rev. Biochem., 66:613-638) ) (SEQ ID NO:21). 

10 To obtain the entire coding region corresponding to the Arabidopsis ACAT-like 

EST, synthetic oligo-nucleotide primers are designed to amplify the 5' and 3' ends of 
partial cDNA clones containing ACAT-like sequences. Primers are designed according to 
the Arabidopsis ACAT-like EST sequence and are used in Rapid Amplification of cDNA 
Ends (RACE) reactions (Frohman et al (1988) Proc. Natl. Acad, ScL USA 85:8998- 

15 9002). 

Primers are designed 5 * -TGC AAATTG ACG AGC AC ACC A ACCCCTTC-3 ' 
(SEQ ID NO: 26) and 5 , -AAGGATGCTTTGAGTTCCTGACAATAGG-3 , (SEQ ID 
NO:27) to amplify the 5' end from the Arabidopsis ACAT EST sequence. Amplification 
of flanking sequences from cDNA clones are performed using the Marathon cDNA 
2 0 Amplification kit (Clontech, CA). 

Sequence derived from the 5' -RACE amplification is used to search proprietary 
Arabidopsis EST libraries. A single EST accession, LIB25-088-C7 (SEQ ID NO: 1), is 
identified which contains sequence identical to the 5' -RACE sequence. Furthermore, 
LIB25-088-C7 is found to contain the complete putative coding sequence for the 

2 5 Arabidopsis ACAT-like product ) (SEQ ID NO: 1). 

The nucleic acid as well as the putative translation product sequences of A042298 
were used to search public and proprietary databases. Four EST sequences are identified 
in both soybean (Figure 2) (SEQ ID NO:3-6) and maize (Figure 3) (SEQ ID NO:7-10) 
proprietary databases, and a single ACAT-like sequence is identified from Morrierrella 

3 0 alpina EST sequences (Figure 4) (SEQ ID NO: 1 1). 

The rat ACAT-like DN A sequence derived from the assembly of single pass 5' 
terminal sequences of a number of cDNA sequences (SEQ ID NO: 15) does not contain a 
single open reading frame, due to errors in the sequence obtained. Thus, the DNA 
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sequence coding for the full length Rat ACAT-like sequence is obtained using RACE 
reactions for use in expression construct preparation. 

A rat cDNA clone (# 700938833) ) (SEQ ID NO: 15), which is homologous to the 
rat ACAT-like sequence described above (Figure 8) is obtained, and its DNA sequence is 
determined. The largest open reading frame extended to the 5' end of the clone, 
suggesting that the cDNA is not long enough to encode the entire protein. 

A cDNA clone representing the 5'end of the ACAT-like gene is isolated using 
RAT Marathon-Ready cDNA derived from rat adipocyte tissue (Clontech #7481-1) 
according to the manufacturer's protocol. The primary PCR reaction is carried out using 
the gene specific primer 5 ' -T AGGTG AC AG ACTC AGC ATTCC ACC AGTCCC-3 ' (SEQ 
ID NO:28), and a nested PCR reaction is carried out using the gene specific primer 5'- 
CGCCAGCTTTAAGAGACGCTCAAT GATTCG-3' (SEQ ID NO:29). The nested 
PCR yields a prominent product approximately 900 nucleotides in length. The PCR 
product is cloned into plasmid pCR2.1 according to the manufacturer's protocol 
(Invitrogen). The sequence of several clones was determined. Although the open reading 
frame extends to the 5' end of the cDNA, the prominent PCR product of a discrete size in 
the 5'RACE reaction suggests that the 5' end of the clone represents the 5* end of the 
mRNA. It remains a possibility that the protein is larger than the protein inferred from the 
DNA sequence of the presently described cDNA. The sequence of the 5'RACE products 
and the clone were assembled to yield the sequence described in this application (Figure 

10) (SEQ ID NO: 16). 

The primers 5'-GGATCCCTGCAGGTCAGGCCCCCACTGGGGCATCATA-3* 

(SEQ ID NO:30) and 5'- 

GGATCCGCGGCCGCACAATGGGCGACCGCGGAGGCGCGGGA-3' (SEQ ID 
NO:31) are used to PCR amplify the open reading frame (ORF) from rat adipocyte 
Marathon Ready cDNA (clontech). These primers introduce NotI and Sse8387I 
restriction sites at the 5* and 3' ends of the ORF, respectively. The PCR product was 
cloned into plasmid pCR2. 1 according to the manufacturer's protocol (Invitrogen) to yield 
plasmid pCGN8592. The complete nucleotide sequence and deduced amino acid sequence 
for the rat ACAT-like gene are shown in Figures 10 (SEQ ID NO: 16) and 1 1 (SEQ ID 
NO: 17), respectively. For expression of the Rat ACAT-like protein in insect cells using a 
baculovirus expression system, the NotI-Sse8387I fragment of pCGN8592 was cloned 
into Notl-PstI digested plasmid pFASTBACl (Gibco), and the resultant plasmid 
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pCGN9704, was transformed into E. coli DH10BAC (Gibco) to generate bacmid9704. 
The bacmid DNA was used to transfect insect cells. 

The rat ACAT-like protein sequence with used to query the Worm Pep database 
(http://www.sanger.ac.uk/Projects/ C_elegans/blast_server.shtml) using BlastP. One 
5 sequence, H19N07.4 (SEQ ID NO: 18), showed significant homology to the rat sequence. 
The nematode sequence was aligned to the rat ACAT-like sequence using the Clustal W 
alignment tool of Macvector (Oxford Molecular). After alignment, 45% of the amino 
acids are identical and 62% are similar or identical. The nematode protein is likely to 
harbor DAGAT activity and could be used to produce triglycerides in transgenic cells. C. 

10 elegans clone yk453a2 (SEQ ID NO: 19 and 20) appears to be a full length cDNA clone 
encoding the C. elegans protein (SEQ ID NO: 18). Both the 5' and 3' end sequences of 
this clone are present in Genbank. This sequence can be used to design PCR primer to 
amplify the ORF with suitable restriction sites for expression of the C. elegans protein in 
heterologous systems such as insect cells, plant cells, E. coli and other microbes. For 

15 example, the primers 5'- 

GGATCCGCGGCCGCACAATGCGTCAACAAACGGGACGACGG (SEQ ID NO:32) 
and 5'-GGATCCCCTGCAGGTCAAATACCAACGGTTTGGTTTTG (SEQ ID NO:33) 
could be used to amplify the cDNA encoding the C. elegans protein. These primers 
introduce NotI and Sse8387I sites suitable for cloning the ORF for expression in plant, 

2 0 insect cell, and E. coli cells using vectors described elsewhere in this application. 



Example 3 Sequence Comparisons 

Sequence alignments between ACAT sequences from several different sources are 
compared to identify the similarity between the sequences. Nucleotide sequences from 

2 5 known human and mouse ACATs, as well as nucleotide sequences from known yeast 

ACATs are compared to the ACAT-like EST sequences from human and Arabidopsis. 

Analysis of the sequence alignments reveals several classes of ACATs based on 
sequence similarity. The known human and mouse ACATs, being 88% similar in the 
nucleotide sequence, form one class of ACATs. Another class of ACATs includes the 

3 0 yeast ACATs which are less than 20% similar to the known human and mouse class 

ACATs. 

The final class of ACATs includes the Arabidopsis (Figure 1) ) (SEQ ID NO:l) 
and human (Figure 7) ) (SEQ ID NO: 14) sequences disclosed in the present invention. 
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This class is approximately 22% similar to the known human and mouse ACAT class and 
approximately 23% similar to the yeast class of ACATs. Thus, the ACAT sequences 
disclosed in the present invention represent a novel class of ACAT enzymes. Partial 
mouse sequences of this class are also provided (Figs. 5 and 6). 
5 The cDNA obtained coding for the entire rat ACAT-like protein is 1766 

nucleotides in length (Figure 10). The protein coded by the DNA sequence is 500 amino 
acids in length (Figure 11), and its molecular weight is 57 kDa. The reading frame is 
open upstream (5') of the methionine to the 5' terminus of the cDNA, thus it is possible 
that the protein is larger than that predicted by the cDNA sequence. When the rat ACAT- 

10 like protein sequence is aligned with the amino acid sequences of the human and mouse 
sequences, it is found that the rat sequence is 20% identical, and 34% similar. However, 
when the rat sequence is compared to the Arabidopsis ACAT-like protein sequence, the 
rat sequence is 30% identical and 44% similar. 

Furthermore, sequence comparisons between the rat and Arabidopsis ACAT-like 

15 amino acid sequences and the known human and mouse ACAT sequences (Figure 17) 
shows a peptide sequence, GAAAQNTVSYP, unique to the rat ACAT-like protein. 

Example 4 Expression constructs 

4A. Baculovirus Expression Constructs 

20 Constructs are prepared to direct the expression of the Arabidopsis ACAT-like 

sequence in cultured insect cells. The entire coding region of the Arabidopsis ACAT-like 
sequence is amplified from the EST clone LIB25-088-C7 using oligonucleotide primers 
5'- TCGACCTGCAGGAAGCTTAGAAATGGCGATTTTGGATTC-3' ) (SEQ ID 
NO:34)and 5'- GGATCCGCGGCCGCTCATGACATCGATCCTTTTCGG-3' ) (SEQ ID 

25 NO:35) in a polymerase chain reaction (PCR). The PCR product was subcloned into 
pCR2. 1 (Invitrogen). Double stranded DNA sequence was obtained to verify that no 
errors were introduced by PCR amplification. The resulting plasmid was designated 
pCGN8626. 

pCGN8626 was digested with NotI and the 5' overhang was filled in with Klenow 
30 fragment. The plasmid was further digested with Sse8387I and a fragment containing the 
ACAT homologue coding region was purified by gel electrophoresis. The fragment 
containing the entire coding region of the Arabidopsis ACAT-like sequence was 
subcloned into baculovirus expression vector pFastBacl (Gibco-BRU Gaithersburg, MD) 
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that had been digested with HindllL blunt-ended by filling in the 5' overhand with 
Klenow fragment and then digesting with Pstl. The resulting plasmid was designated 
pCGN863 1 . DNA sequence analysis confirmed the integrity of the cloning junctions. 
4B. Plant Expression Construct Preparation 
5 A plasmid containing the napin cassette derived from pCGN3223 (described in 

USPN 5,639,790, the entirety of which is incorporated herein by reference) was modified 
to make it more useful for cloning large DNA fragments containing multiple restriction 
sites, and to allow the cloning of multiple napin fusion genes into plant binary 
transformation vectors. An adapter comprised of the self annealed oligonucleotide of 
10 sequence 

CGCGATTTAAATGGCGCGCCCTGCAGGCGGCCGCCTGCAGGGCGCGCCATTT 
AA (SEQ ID NO:36) AT was ligated into the cloning vector pBC SK+ (Stratagene) after 
digestion with the restriction endonuclease BssHII to construct vector pCGN7765. 
Plamids pCGN3223 and pCGN7765 were digested with NotI and ligated together. The 
15 resultant vector, pCGN7770, contains the pCGN7765 backbone with the napin seed 
specific expression cassette from pCGN3223. 

The cloning cassette, pCGN7787, essentially the same regulatory elements as 
pCGN7770, with the exception of the napin regulatory regions of pCGN7770 have been 
replaced with the double CAMV 35S promoter and the tml polyadenylation and 

2 0 transcriptional termination region. 

A binary vector for plant transformation, pCGN5139, was constructed from 
pCGN1558 (McBride and Summerfelt, (1990) Plant Molecular Biology, 14:269-276). 
The polylinker of pCGN1558 was replaced as a HindIII/Asp7 1 8 fragment with a 
polylinker containing unique restriction endonuclease sites, AscI, Pad, Xbal, Swal, 
25 BamHI,and NotI. The Asp718 and Hindlll restriction endonuclease sites are retained in 
pCGN5139. 

A series of turbo binary vectors are constructed to allow for the rapid cloning of 
DNA sequences into binary vectors containing transcriptional initiation regions 
(promoters) and transcriptional termination regions. 

3 0 The plasmid pCGN8618 was constructed by ligating oligonucleotides 5'- 

TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3 , ) (SEQ ID NO:37) and 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-3' ) (SEQ ID NO:38) into 
Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker 
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and napin 3' region was excised from pCGN8618 by digestion with Asp718I; the 
fragment was blunt-ended by filling in the 5' overhangs withKlenow fragment then 
ligated into pCGN5139 that had been digested with Asp718I and Hindlll and blunt-ended 
by filling in the 5' overhangs with KJenow fragment. A plasmid containing the insert 
5 oriented so that the napin promoter was closest to the blunted Asp7 181 site of pCGN5 139 
and the napin 3 ? was closest to the blunted Hindlll site was subjected to sequence analysis 
to confirm both the insert orientation and the integrity of cloning junctions. The resulting 
plasmid was designated pCGN8622. 

The plasmid pCGN8619 was constructed by ligating oligonucleotides 5'- 

10 TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC -3' ) (SEQ ID NO:39) and 5*- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3' ) (SEQ ID NO:40) into 
Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker 
and napin 3' region was removed from pCGN8619 by digestion with Asp718I; the 
fragment was blunt-ended by filling in the 5* overhangs with Klenow fragment then 

15 ligated into pCGN5139 that had been digested with Asp718I and Hindlll and blunt-ended 
by filling in the 5' overhangs with Klenow fragment. A plasmid containing the insert 
oriented so that the napin promoter was closest to the blunted Asp718I site of pCGN5139 
and the napin 3* was closest to the blunted Hindlll site was subjected to sequence analysis 
to confirm both the insert orientation and the integrity of cloning junctions. The resulting 

2 0 plasmid was designated pCGN8623. 

The plasmid pCGN8620 was constructed by ligating oligonucleotides 5*- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGGAGCT -3* ) (SEQ ID NO:41) and 
5 ' -CCTGC AGG A AGCTTGCGGCCGCGG ATCC-3 ' ) (SEQ ID NO:42) into Sall/Sacl- 
digested pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' 

2 5 region was removed from pCGN8620 by complete digestion with Asp7 181 and partial 

digestion with Notl. The fragment was blunt-ended by filling in the 5' overhangs with 
Klenow fragment then ligated into pCGN5139 that had been digested with Asp718I and 
Hindlll and blunt-ended by filling in the 5' overhangs withKlenow fragment. A plasmid 
containing the insert oriented so that the d35S promoter was closest to the blunted 

3 0 Asp718I site of pCGN5139 and the tml 3' was closest to the blunted Hindlll site was 

subjected to sequence analysis to confirm both the insert orientation and the integrity of 
cloning junctions. The resulting plasmid was designated pCGN8624. 
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The plasmid pCGN8621 was constructed by ligating oligonucleotides 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCCAGCT -3' ) (SEQ ID NO:43) and 
5 ' -GGATCCGCGGCCGC A AGCTTCCTGC AGG-3 * ) (SEQ ID NO:44) into Sall/SacI- 
digested pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' 
5 region was removed from pCGN8621 by complete digestion with Asp718I and partial 
digestion with Notl. The fragment was blunt-ended by filling in the 5' overhangs with 
Klenow fragment then ligated into pCGN5139 that had been digested with Asp718I and 
Hindlll and blunt-ended by filling in the 5' overhangs with Klenow fragment. A plasmid 
containing the insert oriented so that the d35S promoter was closest to the blunted 
10 Asp718I site of pCGN5 139 and the tml 3' was closest to the blunted Hindlll site was 

subjected to sequence analysis to confirm both the insert orientation and the integrity of 
cloning junctions. The resulting plasmid was designated pCGN8625. 

A fragment containing the Arabidopsis ACAT-like coding region was removed 
from pCGN8626 by digestion with Sse8387I and Not I. The fragment containing the 
15 ACAT-like sequence was ligated into Pstl-Not I digested pCGN8622. The resulting 

plasmid was designated pCGN8627. DNA sequence analysis confirmed the integrity of 
the cloning junctions. 

A fragment containing the Arabidopsis ACAT-like coding region was removed 
from pCGN8626 by digestion with Sse8387I and Not L The fragment was ligated into 
2 0 Pstl-Not I digested pCGN8623. The resulting plasmid was designated pCGN8628. DNA 
sequence analysis confirmed the integrity of the cloning junctions. 

A fragment containing the Arabidopsis ACAT-like coding region was removed 
from pCGN8626 by digestion with Sse8387 and Not I. The fragment was ligated into 
Pstl-Not I digested pCGN8624. The resulting plasmid was designated pCGN8629. DNA 

2 5 sequence analysis confirmed the integrity of the cloning junctions. 

A fragment containing the Arabidopsis ACAT-like coding region was removed 
from pCGN8626 by digestion with Sse8387 and Not I. The fragment was ligated into 
Pstl-Not I digested pCGN8625. The resulting plasmid was designated pCGN8630. DNA 
sequence analysis confirmed the integrity of the cloning junctions. 

3 0 An additional expression construct for the suppression of endogenous ACAT-like 

activity was also prepared. The construct pCGN8660 was constructed by cloning 
approximately 1 Kb of the Arabidopsis ACAT-like coding region from pCGN8626 in the 
sense orientation, and the full-length Arabidopsis ACAT-like coding region in the 
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antisense orientation under the regulatory control of the napin transcription initiation 
sequence. 

For expression of the rat ACAT-like sequence in plants, the NotI-Sse8387I 
fragment of pCGN8592 was cloned into Notl-PstI digested binary vectors pCGN8621, 
5 pCGN8622, and pCGN8624 to yield plasmids, pCGN 9700, pCGN9701 , and pCGN9702, 
respectively. Plasmid pCGN9700 expresses a sense transcript of the rat ACAT-like 
cDNA under control of a napin promoter, plasmid pCGN9701 expresses an antisense 
transcript of the rat ACAT-like cDNA under control of a napin promoter, and plasmid 
pCGN9702 expresses a sense transcript of the rat ACAT-like cDN A under control of a 

10 double 35S promoter. Plasmids pCGN 9700, pCGN9701, and pCGN9702 are introduced 
in Agrobacterium tumefaciens EH A 10 1 . 

Constructs are prepared to direct the expression of the rat ACAT-like sequence in 
the seed embryo of soybean and the endosperm of corn. For expression of the rat ACAT- 
like DNA sequence in soybean, a 1.5 kb NotVSse83&7\ fragment from pCGN8592 

15 containing the coding sequence of the rat ACAT-like sequence was blunt ended using 

Mung bean nuclease, and ligated into the Smal site of the turbo 7S binary/cloning vector 
pCGN8809 to create the vector pCGN8817 (Figure 15) for transformation into soybean 
by panicle bombardment. The vector pCGN8817 contains the operably linked 
components of the promoter region of the soybean a* subunit of p-conglycinin (7S 

20 promoter, (Chen et al, ( 1986), Proc. NatL Acad. 5c/.. 83:8560-8564), the DNA sequence 
coding for the entire rat ACAT-like protein, and the transcriptional termination region of 
pea RuBisCo small subunit, referred to as E9 3' (Coruzzi, et al (1984) EMBO 7. 3:1671- 
1679 and Morelli, et al (1985) Nature 315:200-204). This construct further contains 
sequences for the selection of positive transformed plants by screening for resistance to 

2 5 glyphosate using the CP4 EPSPS (U.S. Patent 5,633,435) expressed under the control of 

the figwort mosaic virus (FMV) promoter (U.S. Patent Number 5,378,619) and the 
transcriptional termination region of E9. 

For expression of the rat ACAT-like sequence in the corn endosperm, a 1.5 kb 
NotVSse%3%ll fragment from pCGN8592 containing the coding sequence of the rat 

3 0 ACAT-like sequence was blunt ended using Mung bean nuclease, and ligated into the 

BamHl site of the rice pGtl expression cassette pCGN8592 for expression from the pGtl 
promoter (Leisy, DJ. et al.. Plant Mol. Biol. 14 (1989) 41-50) and the HSP70 intron 
sequence (U.S. Patent Number 5.593,874). This cassette also includes the transcriptional 
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termination region downstream of the cloning site of nopaline synthase, nos 3' (Depicker 
et al., J. Molec. Appl. Genet, (1982) 1: 562-573). A 7.5 kb fragment containing the pGtl 
promoter, the DNA sequence encoding the rat ACAT-like protein, and the nos 
transcriptional termination sequence is cloned into the binary vector pCGN88 16 to create 
5 the vector pCGN8818 (Figure 16) for transformation into corn. This construct also 

contains sequences for the selection of positive transformants with kanamycin using the 
kanamycin resistance gene from Tn5 bacteria under the control of theCAMV 35S 
promoter and tml transcriptional termination regions. 

10 Examples Expression of AC ATs in Insect Cell Culture 

A baculovirus expression system is used to express the full length rat and 
Arabidopsis ACAT-like cDNA in cultured insect cells. 

The baculovirus expression construct pCGN863 i is transformed and expressed 
using the BAC-to-BAC Baculovirus Expression System (Gibco-BRL, Gaithersburg, MD) 
15 according to the maufacturers directions, except harvesting of recombinant viruses was 
done 5 days post-transfection. The supernatant from the transfection mixture is used for 
generating virus stock which in turn is used for infecting Sf9 cells for use in the assay. 

The transformed insect cells may be assayed for ACAT, DAGAT or other 
acyltransferase activities using methods described herein. Insect cells are centrifuged and 
2 0 the resulting pelletted cells were resuspended in Medium I (0.25 M sucrose and 1 mM 
EDTA in 10 mM Tris-Cl, pH 7.4) and homogenized on ice. The homogenate is 
centrifuged at 105,000 x g for 1 at 4° C. Total membranes are resuspended in Medium I. 
DAGAT activity is assayed in a 0.2 ml reaction mixture containing 175 mM Tris, pH 8, 1 
mg/ml bovine serum albumin, 8 mM MgC12, 0.2 mM 1,2-diolein in acetone, 20 mM 3H- 

2 5 paimitoyl-CoA, and 0.5-30 mg of membrane proteins. The final concentration of acetone 

is 10%. In some assays, to measure DAGAT activity under optimal conditions for plant 
enzyme, the 0.2 ml reaction mixture contained 0.1 M NaCl, 0. 1 % Triton X 100, 0.5 mM 
1,2-diolein, 10 mM EDTA, 0.1 M Tris, pH 7.8, and 20 mM 3H-palmitoyl-CoA. The 10 
min reaction is terminated by addition of 1.5 ml solution of isopropanol: heptane: water 

3 0 (80:20:2, v/v). The lipids are extracted and analyzed by thin layer chromatography as 

described (Coleman R.A. (1992) Methods. Enzymol. 209, 98-104). 
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The results of the DAG AT activity assay demonstrates that the RAT ACAT 
related DNA sequence encodes a protein with DAGAT activity about 80 fold greater than 
the controls. 

5 Example 6 E. coii expression of the rat DAGAT 

An E. coli expression vector was constructed by annealing the oligonucleotide 
primers 5 ' -C AGGAGGCGGCCGC AGG AGGCTGC AGGT AC ) (SEQ ID NO:45) and 
5' CCTGCAGCCTCCTGCGGCCGCCTCCTGAGCT ) (SEQ ID NO:46) to make a 
synthetic adapter. The adapter was ligated to plasmid pBC SK+ (Stratagene) after 

10 digesting the plasmid with the restriction endonucleases, SstI and Kpnl. The resultant 
plasmid was named pCGN9909. Plasmid pCGN9909 was digested with NotI and PstI 
and ligated with the NotI-Sse8387I fragment from pCGN8592. The resultant plasmid, 
containing the rat DAGAT ORF under control of the E. coli Lac promoter, was designated 
pCGN9720. Five ml cultures E. coli cultures of pCGN9909 and pCGN9720 were grown 

15 overnight at 30 degrees centigrade to stationary stage in ECLB + lOOug/ml ampicillin. 
The 5 ml cultures were added to 50 ml of ECLB + 0. 1 mM IPTG and 100 ug/ml 
ampicillin, and the cultures were grown for 4 hours at 30 degrees. The cells were 
pelleted, resuspended in 2 ml isopropanol. and incubated at 75 degrees centigrade for 30 
minutes. The solution was cooled to room temperature and 3 ml of hexane were added. 

2 0 The cells were incubated with shaking for 1 hour, 3 ml of 6.6% sodium sulfite was added, 
the solution was vortexed. and the upper organic phase was removed to a clean test tube. 
The solvent was evaporated under a stream of nitrogen gas, and the lipids were resolved 
by TLC on silica G. The TLC plate was developed in hexane:diethyl ethenacetic acid 
(75:25: 1). The lipids were visualized by staining with iodine. Triglycerides were 

2 5 observed in the E. coli transformed with pCGN9720, but not in the E. coli transformed 

with the empty vector pCGN9909 (Figure 1 8). This suggests that the rat DAGAT can be 
used to produce triglycerides in microbes and other cells which otherwise would produce 
little or not triglyceride. 

3 0 Example 7 Plant Transformation 

A variety of methods have been developed to insert a DNA sequence of interest into 
the genome of a plant host to obtain the transcription or transcription and translation of the 
sequence to effect phenotypic changes. 
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The plant binary constructs pCGN8627, pCGN8628, pCGN8629 and pCGN8630 are 
used in plant transformation to direct the expression of the Arabidopsis ACAT-like sequence 
from plant tissues. 

Transgenic Brassica plants are obtained by Agrobacterium-mediatcd 
5 transformation as described by Radke et al. (Theor. Appl. Genet, (1988) 75:685-694; 
Plant Cell Reports (1992) 77:499-505). Transgenic Arabidopsis thaliana plants may be 
obtained by Agrobacterium-mtdiat&d transformation as described by Valverkens et ai, 
{Proc. Nat. Acad. ScL (1988) 55:5536-5540), or as described by Bent et al. ((1994), 
Science 265:1856-1860), or Bechtold et al. ((1993), C.R.Acad.Sci, Life Sciences 
10 316:1 194-1 199). Other plant species may be similarly transformed using related 
techniques. 

Alternatively, microprojectile bombardment methods, such as described by Klein 
et al. (Bio/Technology 70:286-291) may also be used to obtain nuclear transformed plants. 

15 Example 8 Analysis of Transgenic Plants 

Transgenic plants expressing proteins derived from the novel ACAT-like 
sequences are analyzed using techniques known in the art for sterol esterification activity 
and/or TAG synthesis activity. Enzyme assays are used to determine the enzyme activity 
of the ACAT-like translational product in plants transformed with pCGN8629, and plants 
2 0 transformed with pCGN8630. Leaf extracts are analyzed by thin layer chromatography to 
determine glycerolipid composition of the leaf lipids as well as sterol content. Seed 
extracts of the control plants, plants transformed with pCGN8627, and plants transformed 
with pCGN8628 are analyzed for alterations in the levels of diacylglycerol, 
triacylglycerol, or phospholipids as well as modifications of the sterol levels. 

2 5 A single leaf was removed from each of 10-20 Arabidopsis plants. The leaves 

were weighed, and leaf lipids were extracted in 4 ml hexane:isopropanol (3:2). 2 ml of 
6.6% sodium sulfite was added, the solution was vortexed, and the upper organic phase 
was removed to a fresh test tube. The solvent was dried under nitrogen gas, and the leaf 
neutral lipids were resuspended in 50 ul of hexane. The leaf lipids were resolved by TLC 

3 0 on Silica G TLC plates developed in hexane: diethyl ethenacetic acid (75:25:1). After 

development, the lipids were visualized by staining with primulin (0.01% in 80% acetone) 
and viewing under long wave (350 nM) UV illumination. The lipids that migrated the 
same distance as a triglyceride standard were transmethylated in acidic methanol (8.5% 
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H2S04 in methanol for 2 hours) with 62.5 ug of a tri-17:0 triglyceride standard, and the 
fatty acid methyl esters (FAMES)were resolve by gas chromatography. The quantity of 
triglycerides was determined by comparing the peak area of the leaf triglyceride FAMEs 
with the area of the 17:0 internal standard. Leaves from untransformed control plants 
5 contained 0.041 ug TAG/mg fresh weight, while the leaves from pCGN9702 plants 

contained 0.49 ug TAG/mg fresh weight. Thus, expression of the rat ACAT-like cDNA 
in transgenic leaf tissue directed more than a 10 fold increase in the quantity of TAG in 
leaf tissue. 

10 The above results demonstrate that the ACAT-like nucleic acid sequences 

identified in the present invention encode for proteins active in the formation 
triacylglycerol. Such nucleic acid sequences may be used in constructs to provide for the 
expression of the ACAT-like protein in host cells. Furthermore, such expression 
constructs may be employed in methods for modifying triacylglycerol content of host cells 

15 and organisms. 

All publications and patent applications mentioned in this specification are 
indicative of the level of skill of those skilled in the art to which this invention pertains. 
All publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent application was specifically and 

2 0 individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious that 
certain changes and modifications may be practiced within the scope of the appended 
claim. 



40 



WO 99/63096 PCT/US99/12S41 

Claims 

What is claimed is: 

1. An isolated DNA sequence encoding an enzyme of the class of acyl- 
CoA:cholesterol acyltransferase-like proteins. 

2. The DNA encoding sequence of Claim 1 wherein said DNA sequence is 
isolated from a plant. 

3. The DNA encoding sequence of Claim 2 wherein said plant is selected from 
the group consisting of Arabidopsis thaliana, soybean, or corn. 

4. The DNA encoding sequence of Claim 3 wherein said acyl-CoA:cholesterol 
acyltransferase-like protein is encoded by the sequence of Figure 1 . 

5. The DNA encoding sequence of Claim 3 wherein said acyl-CoA:cholesterol 
acyltransferase-like protein is encoded by a sequence which includes the EST of Figure 2. 

6. The DNA encoding sequence of Claim 3 wherein said acyl-CoA:cholesterol 
acyltransferase-like protein is encoded by a sequence which includes the EST of Figure 3. 

7. The DNA encoding sequence of Claim 1 wherein said DNA encoding sequence 
is isolated from a non-plant source. 

8. The DNA encoding sequence of Claim 7 wherein said non-plant source is 
selected from the group consisting of rat, human, mouse, Mortierella, or Caenorhabditis 
elegans. 

9. The DNA encoding sequence of Claim 8 wherein said acyl-CoA:cholesterol 
acyltransferase-like protein is encoded by the sequence of Figure 10. 

10. The DNA encoding sequence of Claim 8 wherein said acyl-CoAxholesterol 
acyltransferase-like protein is encoded by the sequence of Figure 7. 

11. The DNA encoding sequence of Claim 8, wherein said acyl-CoAxholesterol 
acyltransferase-like protein is encoded by the sequence of Figure 5 or 6. 
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12. The DNA encoding sequence of Claim 8, wherein said acyl-CoA:cholesterol 
acyltransferase-like protein is encoded by the sequence comprising the EST sequence of 
Figure 4. 

13. The amino acid sequence of Claim 8, wherein said acyl-CoAxholesterol 
5 acyltransferase-like protein has the amino acid sequence of Figure 12. 

14. A recombinant DNA construct comprising any of the DNA encoding 

sequences of Claims 1-13. 

15. The recombinant DNA construct according to Claim 14, wherein said acyl- 
CoAxholesterol acyltransferase-like protein has diacylglycerol acyltransferase activity 

10 16. The DNA construct of Claim 14 including a transcriptional initiation region 

functional in plant cells. 

17. A plant cell comprising a DNA construct of 14. 

18. A plant comprising a cell of Claim 17. 

19. A method of modifying the lipid composition in a plant cell, said method 
15 comprising: 

transforming a plant cell a construct of Claim 14 and, 

growing said cell under conditions wherein transcription of said ACAT-like 
sequence is initiated, 

whereby said lipid composition is modified. 
2 0 20. A method according to Claim 19 wherein said acyl-CoAxholesterol 

acyltransferase-like protein encoding sequence is in an antisense orientation, whereby 
transcribed mRNA from said sequence is complementary to the equivalent mRNA 
transcribed from the endogenous gene, whereby the activity of said acyl-CoAxholesterol 
acyltransferase-like protein in said plant cell is suppressed. 
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21. A method according to Claim 20, wherein said acyi-CoA:cholesterol 
acyltransferase-like protein has diacylglycerol acyltransferase activity whereby the 
synthesis of triglycerides is suppressed in said plant cell. 

22 . A method according to Claim 19, wherein said acyl-CoA:cholesterol 
5 acyltransferase-like protein encoding sequence is in a sense orientation. 

23 . A method according to Claim 22, wherein said acyl-CoA:cholesterol 
acyltransferase-like protein has diacylglycerol acyltransferase activity whereby the 
synthesis of triglycerides is increased in said plant cell. 

24. A plant cell modified according to the method of Claim 19, wherein said plant cell is 
10 selected from the group consisting of Brassica, corn, soybean, safflower, alfalfa, and sunflower 

cells. 

25. A method of modifying the lipid composition in a host cell, said method 
comprising: 

transforming a host cell with DNA comprising as operably associated components 
15 in the 5' to 3' direction of transcription, a transcriptional initiation region functional in 
said host cell, a DNA sequence encoding an acyl-CoA:cholesterol acyltransferase-like 
protein, and a transcription termination sequence, and 

growing said cell under conditions wherein transcription of said ACAT-like 
sequence is initiated, 
2 0 whereby said lipid composition is modified. 

2 6 . The method of Claim 25 wherein said host cell is a prokaryotic cell. 

27 . The method according to Claim 26, wherein said prokaryotic cell is an£. 
coli cell. 

2 8 . The method of Claim 25 wherein said host cell is a eukaryotic cell. 
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2 9 . The method according to Claim 28, wherein said eukaryotic cell is selected 



from the group consisting of plant, fungal, or algal cells. 



30. 



The method of Claim 29, wherein said plant cell is a leaf cell. 



31 . 



The method of Claim 29, wherein said plant cell is a seed cell. 



5 



32 . 



The method of Claim 31, wherein said seed cell is a seed embryo cell. 



3 3 . The method of Claim 3 1 , wherein said seed cell is a seed endosperm cell. 

3 4 . The method of Claim 25, wherein said transcriptional initiation region is 
from a gene preferentially expressed in plant embryo tissue. 

3 5 . The method of Claim 25, wherein said DNA encoding sequence codes for a 
10 protein having diacylglycerol acyltransferase activity. 

3 6 . The method according to Claim 25 further comprising harvesting mature 
plant seed and separating a modified oil from the meal of said plant seed. 

37. A plant oil produced according to the method of Claim 36 comprising a modified 
triglyceride content. 

15. 38. A plant oil produced according to the method of Claim 37, wherein said triglyceride 

content is increased. 

39. A plant oil produced according to the method of Claim 38, wherein said increase in 
triglyceride content comprises an increase in the percent of triglyceride per fresh weight of tissue. 
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CTCTCGTGAATCCTTTTTCCTTTCTTCTTCTTCTTCTCTTCAGAGAAAACTTTGCTTCTC 
TTTCTATAAGGAACCAGACACGAATCCCATTCCCACCGATTTCTTAGCTTCTTCCTTCAA 
TCCGCTCTTTCCCTCTCCATTAGATTCTGTTTCCTCTTTCAATTTCTTCTGCATGCTTCT 
CGATTCTCTCTGACGCCTCTTTTCTCCCGACGCTGTTTCGTCAAACGCTTTTCGAAATGG 
CGATTTTGGATTCTGCTGGCGTTACTACGGTGACGGAGAACGGTGGCGGAGAGTTCGTCG 
ATCTTGATAGGCTTCGTCGACGGAAATCGAGATCGGATTCTTCTAACGGACTTCTTCTCT 
CTGGTTCCGATAATAATTCTCCTTCGGATGATGTTGGAGCTCCCGCCGACGTTAGGGATC 
GGATTGATTCCGTTGTTAACGATGACGCTCAGGGAACAGCCAATTTGGCCGGAGATAATA 
ACGGTGGTGGCGATAATAACGGTGGTGGAAGAGGCGGCGGAGAAGGAAGAGGAAACGCCG 
ATGCTACGTTTACGTATCGACCGTCGGTTCCAGCTCATCGGAGGGCGAGAGAGAGTCCAC 
TTAGCTCCGACGCAATCTTCAAACAGAGCCATGCCGGATTATTCAACCTCTGTGTAGTAG 
TTCTTATTGCTGTAAACAGTAGACTCATCATCGAAAATCTTATGAAGTATGGTTGGTTGA 
TCAGAACGGATTTCTGGTTTAGTTCAAGATCGCTGCGAGATTGGCCGCTTTTCATGTGTT 
GTATATCCCTTTCGATCTTTCCTTTGGCTGCCTTTACGGTTGAGAAATTGGTACTTCAGA 
AATACATATCAGAACCTGTTGTCATCTTTCTTCATATTATTATCACCATGACAGAGGTTT 
TGTATCCAGTTTACGTCACCCTAAGGTGTGATTCTGCTTTTTTATCAGGTGTCACTTTGA 
TGCTCCTCACTTGCATTGTGTGGCTAAAGTTGGTTTCTTATGCTCATACTAGCTATGACA 
TAAGATCCCTAGCCAATGCAGCTGATAAGGCCAATCCTGAAGTCTCCTACTACGTTAGCT 
TGAAGAGCTTGGCATATTTCATGGTCGCTCCCACATTGTGTTATCAGCCAAGTTATCCAC 
GTTCTGCATGTATACGGAAGGGTTGGGTGGCTCGTCAATTTGCAAAACTGGTCATATTCA 
CCGGATTCATGGGATTTATAATAGAACAATATATAAATCCTATTGTCAGGAACTCAAAGC 
ATCCTTTGAAAGGCGATCTTCTATATGCTATTGAAAGAGTGTTGAAGCTTTCAGTTCCAA 
ATTTATATGTGTGGCTCTGCATGTTCTACTGCTTCTTCCACCTTTGGTTAAACATATTGG 
CAGAGCTTCTCTGCTTCGGGGATCGTGAATTCTACAAAGATTGGTGGAATGCAAAAAGTG 
TGGGAGATTACTGGAGAATGTGGAATATGCCTGTTCATAAATGGATGGTTCGACATATAT 
ACTTCCCGTGCTTGCGCAGCAAGATACCAAAGACACTCGCCATTATCATTGCTTTCCTAG 
TCTCTGCAGTCTTTCATGAGCTATGCATCGCAGTTCCTTGTCGTCTCTTCAAGCTATGGG 
CTTTTGTTGGGATTATGTTTCAGGTGCCTTTGGTCTTCATCACAAACTATCTACAGGAAA 
GGTTTGGCTCAACGGTGGGGAACATGATCTTCTGGTTCATCTTCTGCATTTTCGGACAAC 
CGATGTGTGTGCTTCTTTATTACCACGACCTGATGAACCGAAAAGGATCGATGTCATGAA 
ACAACTGTTCAAAAAATGACTTTCTTCAAACATCTATGGCCTCGTTGGATCTCCGTTGAT 
GTTGTGGTGGTTCTGATGCTAAAACGACAAATAGTGTTATAACCATTGAAGAAGAAAAGA 
CAATTAGAGTTGTTGTATCGGA 

>protein 

iy[AILDSAGVTTVTENGGGEFVDLDRLRRRKSRSDSSNGLLLSGSDNNSPSDDVGAPADVR 
DRIDSVVNDDAQGTANLAGDNNGGGDNNGGGRGGGEGRGNADATFTYRPSVPAHRRARES 
PLSSDAIFKQSHAGLFNLCVWLIAVNSRLIIENLMKYGWLIRTDFWFSSRSLRDWPLFM 
CC I SLS IFPLAAFTVEKLVLQKYI SEPWIFLHI I ITMTEVLYPVYVTLRCDSAFLSGVT 
LMLLTCIWLKLVSYAHTSYDIRSLANAADKANPEVSYYVSLKSLAYFMVAPTLCYQPSY 
PRSACIRKGWVARQFAKLVIFTGFMGFIIEQYINPIVRNSKHPLKGDLLYAIERVLKLSV 
PNLYWLCMFYCFFHLWLNILAELLCFGDREFYKDWW^^ 

IYFPCLRSKIPKTLAIIIAFLVSAVFHELCIAVPCRLFKLWAFLGIMFQVPLVFITNYLQ 
ERFG STVGNMI FWF I FC I FGQ PMC VLL YYHDLMNRKGSMS 

Figure 1 
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GAGNNNNGNAAC^ 

TTANACTNAATTNGAAAATNCISn^JNCCCAACTTNAGGNACTTNNAGNCCCCCCNACTTGAC 
AACGGAGCACTATATTTACCCCGTGGTNGTTCAACCCAGCCATCTCACCCTTGCGAGCAT 
TGGTGCTGCTCTTGATACCCTTCATGCTTAACTATCTCATGATCTTTTACATCATTTTCG 
AGTGCATCTGCAACGCCTTTGCGGAACTAAGTTGCTTTGCGGATCGCAACTTTTACGAGG 
ATTGGTGGAACTGCGTCAGCTTTGATGAGTGGGCACGCAAATGGAACAAGCCTGTGCAAC 
ACTTCTTGCTCCGCCACGTGTACGACTCGAGCATCCGAGTCCTTCCACTTGTCCGAAATC 
CAATGCCGCNAATTGCAAACGTTCCTTCCCGGTCGTCAATGCGTTCAACGAACCTGGGTG 
AAGAATGGGTGGTGACAACGTTAAAGTGCGCCCGGTATC 

Figure 4 
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TGGAGGACAACGCGGGGTCTGATACGACTCACTATAGGGAATTTGGCCCTCGAGCAGTAG 
ATTCGGCACGATGGGCACGAGGACTCCATCATGTTCCTCAAGCTTTATTCCTACCGGGAT 
GTCAACCTGTGGTGCCGCCAGCGAAGGGTCAAGGCCAAAGCTGTCTCTACAGGGAAGAAG 
GTCAGTGGGGCTGCTGCGAGCAAGCTGTGAGCTATCCAGACAACCTGACCTACCGAGATC 
TCGATTACTTCATCTTTGCTCCTACTTTGTGTTATGAACTCAACTTTCCTCGGTCCCCCC 
GAATACGAGAGCGCTTTCTGCTACGACGAGTTCTTGAGATGCTCTTTTTTACCCAGCTTC 
AAGTGGGGCTGATCCAACAGTGGATGGTCCCTACTATCCAGAACTCCATGGAAGCCCTTT 
CAAGAGCTTCTGCAGTTTTGGAGACCGCGAGTTCTACAGAGATTGGTGGAATGCTGAGTC 
TGTCACCGACTTTTGGCAGAACTGGAATATCCCCGTGG 

Figure 5 



BNSDOCID: <WO 9963096A2 I > 



WO 99/63096 



6 / 17 



PCT/US99/12541 



CCATGATGGCTCAGGTCCCACTGGCCTGGATTGTGGGCCGATTCTTCCAAGGGAACT 
ATGGCAATGCAGCTGTGTGGGTGACACTCATCATTGGGCAACCGGTGGCTGTCTCAT 
GTATGTCCACGACTACTACGTGCTCAACTACGATGCCCCAGTGGGTCATGAGCTACT 
GCCAAAGGCAGCCCTCCCTAACCTGGGCCTGGAGTTCTGGAGGGGTTCCTGGCTGCC 
TGCACACTCCTCCTAGTCTGGGAGGCCTCTCTGCCCCTATGCGCTACTCCTGCTCTT 

GGGGATGGCATTTG 

Figure 6 
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GTCTGGTGTGATGGGGACAGGGAGGGACTTCCCCTTACCCAGCACTGGTGTTGGCTG 
AGGTGGGTGCTGAGTCTCAGAGCTTGGCATGGAGACCAGACAGGGCTGGGTCTGCAA 
GCCTGAGGCTGCCGCCCTGAGCTCGGGCTGGGACGTGCCCAGAGGTGTTGGGAGGAT 
CTGGGGTGAGTACCCTGTGGCCAGGACTAAAGGGGCTNCACCCTCCTGTCCATCCCT 
CGCAGATCTTGAGCAATGCCCGGTTATTTCTGGAGAACCTCATCAAGTATGGCATCC 
TGGTGGACCCCATCCAGGTGGTTTCTCTGTTCCTGAAGGATCCCTATAGCTGGCCCG 
CCCCATGCCTGGTTATTGCGGCCAATGTCTTTGCTGTGGCTGCATTCCAGGTTGAGA 
AGCGCCTGGCGGTGGGTGCCCTGACGGAGCAGGCGGGACTGCTGCTGCACGTGGCCA 
ACCTGGCCACCATTCTGTGTTTCCCAGCGGCTGTGGTCTTACTGGTTGAGTCTATCA 
CTCCAGTGGGCTCCCTGCTGGCGCTGATGGCGCACACCATCCTCTTCCTCAAGCTCT 
TCTCCTACCGCGACGTCAACTCATGGTGCCGCAGGGCCAGGGCCAAGGCTGCCTCTG 
CAGGGAAGAAGGCCAGCAGTGCTGCTGCCCCGCACACCGTGAGCTACCCGGACAATC 
TGACCTACCGCGATCTCTACTACTTCCTCTTCGCCCCCACCTTGTGCTACGAGCTCA 
ACTTTCCCCGCTCTCCCCGCATCCGGAAGCGCTTTCTGCTGCGACGGATCCTTGAGA 
TGCTGTTCTTCACCCAGCTCCAGGTGGGGCTGATCCAGCAGTGGATGGTCCCCACCA 
TCCAGAACTCCATGAAGCCCTTCAAGGACATGGACTACTCACGCATCATCGAGCGCC 
TCCTGAAGCTGGCGGTCCCCAATCACCTCATCTGGCTCATCTTCTTCTACTGGCTCT 
TCCACTCCTGCCTGAATGCCGTGGCTGAGCTCATGCAGTTTGGAGACCGGGAGTTCT 
ACCGGGACTGGTGGAACTCCGAGTCTGTCACCTACTTCTGGCAGAACTGGAACATCC 
CTGTGCACAAGTGGTGCATCAGACACTTCTACAAGCCCATGCTTCGACGGGGCAGCA 
GCAAGTGGATGGCCAGGACAGGGGTGTTCCTGGCCTCGGCCTTCTTCCACGAGTACC 
TGGTGAGCGTCCCTCTGCGAATGTTCCGCCTCTGGGCGTTCACGGGCATGATGGCTC 
AGATCCCACTGGCCTGGTTCGTGGGCCGCTTTTTCCAGGGCAACTATGGCAACGCAG 
CTGTGTGGCTGTCGCTCATCATCGGACAGCCAATAGCCGTCCTCATGTACGTCCACG 
ACTACTACGTGCTCAACTATGAGGCCCCAGCGGCAGAGGCCTGAGCTGCACCTGAGG 
GCCTGGCTTCTCACTGCCACCTCACACCCGCTGCCAGAGCCCACCTCTCCTCCTAGG 
CCTCGAGTGCTGGGGATGGGCCTGGCTGCACAGCATCCTCCTCTGGTCCCAGGGAGG 
CCTCTCTGCCCCTATGGGGCTCTGTCCTGCACCCCTCAGGGATGGCGACAGCAGGCC 
AGACACAGTCTGATGCCAGCTGGGAGTCTTGCTGACCCTGCCCCGGGTCCGAGGGTG 
TCAATAAAGTGCTGTCCAGTGACCTCTTCAGCCTGCCAGGGGCCTGGGGCCTGGTGG 
GGGGTATGGCCACACCCACAAGGGCGAGTGCCAGAGCTGTGTGGACAGCTGTCCCAG 
GACCTGCCGGGGAGCAGCAGCTCCACTGCAGCAGGGCGGGCATGGCCGGTAGGGGGA 
GTGCAAGGCCAGGCAGACGCCCCCATTCCCCACACTCCCCTACCTAGAAAAGCTCAG 
CTCAGGCGTCCTCT 
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CACGACTGGGCCGCGACGTGGTGCGGGCCGAAGCCATGGGCGACCGCGGAGGCGCGGGAA 
GCTCTCGGCGTCGGAGGACCGGCTCGCGGGTTTCCATCCAGGGTGGTAGTGGGCCCATGG 
TAGACGAAGAGGAGGTGCGAGACGCCGCTGTGGGCCCCGACTTGGGCGCCGGGGGTGACG 
CTCCGGCTCCGGCTCCGGTTCCGGCTCCAGCCCACACCCGGGACAAAGACCGGCAGACCA 
GCGTGGGCGACGGCCACTGGGAGCTGAGGTGCCATCGTCTGCAAGACTCTTTGTTCAGCT 
CAGACAGCGGTTTCAGCAATTACCGTGGTATCCTGAATTGGTGCGTGGTGATGCTGATCC 
TGAGTAATGCAAGGTTATTTTTAGAGAATCTTATCAAGTATGGCATCCTGGTGGATCCCA 
TCCAGGTGGTGTCTCTGTTTCTGAAGGACCCCTACAGCTGGCCTGCCCCATGCTTGATCA 
TTGCATCCAATATCTTTATTGTGGCTACATTTCAGATTGAGAAGCGCCTGTCAGTGGGTG 
CCCTGACAGAGCAGATGGGGCTGCTGCTACATGTGGTTAACCTGGCCACAATTATCTGCT 
TCCCAGCAGCTGTGGCCTTACTGGTTGAGTCTATCACTCCAGTGGGTTCCCTGTTTGCTC 
TGGCATCATACTCCATCATCTTCCTCAAGCTTTTCTCCTACCGGGATGTCAATCTGTGGT 
GCCGCCAGCGAAGGGTCAAGGCCAAAGCTGTGTCTGCAGGGAAGAAGGTCAGTGGGGCTG 
CTGCCCAGAACACTGTAAGCTATCCGGACAACCTGACCTACCGAGATCTCTATTACTTCA 
TCTTTGCTCCTACTTTGTGTTATGAACTCAACTTTCCTCGATCCCCCCGAATACGAAAGC 
GCTTTCTGCTACGGCGGGTTCTTGAGATGCTCTTTTTCACCCAGCTTCAAGTGGGGCTGA 
TCCAGCAGTGGATGGTCCCTACTATCCAGAACTCCATGAAGCCCTTCAAGGACATGGACT 
ATTCACGAATCATTGAGCGTCTCTTAAAGCTGGCGGTCCCCAACCATCTGATATGGCTCA 
TCTTCTTCTATTGGCTTTTCCACTCATGTCTCAATGCTGTGGCAGAGCTCCTGCAGTTTG 
GAGACCGCGAGTTCTACAGGGACTGGTGGAATGCTGAGTCTGTCACCTACTTTTGGCAGA 
ACTGGAATATCCCCGTGCACAAGTGGTGCATCAGACACTTCTACAAGCCTATGCTCAGAC 
TGGGCAGCAACAAATGGATGGCCAGGACTGGGGTCTTTTTGGCGTCAGCCTTCTTCCATG 
AGTACCTAGTGAGCATTCCCCTGAGGATGTTCCGCCTCTGGGCATTCACAGCCATGATGG 
CTCAGGTCCCACTGGCCTGGATTGTGAACCGCTTCTTCCAAGGGAACTATGGCAATGCAG 
CTGTGTGGGTGACACTCATCATTGGGCAACCGGTGGCTGTGCTCATGTATGTCCACGACT 
ACTACGTGCTCAACTATGATGCCCCAGTGGGGGCCTGAGCTACTGCCAAAGGCCAGCCCT 
CCCTAACCTGGGCCTGGAGTTCTGGAGGGCTTCCTGGCTGCCTGCACACTCCTCCTAGTC 
TGGGAGGCCTCTCTGCCCCTATGGGGCCTACTCCTGCTCTTGGGGATGGCACCTGAGTCC 
AGCTGGTATGAGCCAGTGCTGGGAGTCTGTGCTGACCAGGGGCTGAGGATATCAATAAAG 
AGCTATCTAAAAAAAAAAAAAAAAAA 



Figure 8 
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CACGACTGGGCCGCGACGTGGTGCGGGCCGAAGCCATGGGCGACCGCGGAGGCGCGGGAA 
GCTCTCGGCGTCGGAGGACCGGCTCGCGGGTTTCCATCCAGGGTGGTAGTGGGCCCATGG 
TAGACGAAGAGGAGGTGCGAGACGCCGCTGTGGGCCCCGACTTGGGCGCCGGGGGTGACG 
CTCCGGCTCCGGCTCCGGTTCCGGCTCCAGCCCACACCCGGGACAAAGACCGGCAGACCA 
GCGTGGGCGACGGCCACTGGGAGCTGAGGTGCCATCGTCTGCAAGACTCTTTGTTCAGCT 
CAGACAGCGGTTTCAGCAATTACCGTGGTATCCTGAATTGGTGCGTGGTGATGCTGATCC 
TGAGTAATGCAAGGTTATTTTTAGAGAATCTTATCAAGTATGGCATCCTGGTGGATCCCA 
TCCAGGTGGTGTCTCTGTTTCTGAAGGACCCCTACAGCTGGCCTGCCCCATGCTTGATCA 
TTGCATCCAATATCTTTATTGTGGCTACATTTCAGATTGAGAAGCGCCTGTCAGTGGGTG 
CCCTGACAGAGCAGATGQGGCTGCTGCTACATGTGGTTAACCTGGCCACAATTATCTGCT 
TCCCAGCAGCTGTGGCCTTACTGGTTGAGTCTATCACTCCAGTGGGTTCCCTGTTTGCTC 
TGGCATCATACTCCATCATCTTCCTCAAGCTTTTCTCCTACCGGGATGTCAATCTGTGGT 
GCCGCCAGCGAAGGGTCAAGGCCAAAGCTGTGTCTGCAGGGAAGAAGGTCAGTGGGGCTG 
CTGCCCAGAACACTGTAAGCTATCCGGACAACCTGACCTACCGAGATCTCTATTACTTCA 
TCTTTGCTCCTACTTTGTGTTATGAACTCAACTTTCCTCGATCCCCCCGAATACGAAAGC 
GCTTTCTGCTACGGCGGGTTCTTGAGATGCTCTTTTTCACCCAGCTTCAAGTGGGGCTGA 
TCCAGCAGTGGATGGTCCCTACTATCCAGAACTCCATGAAGCCCTTCAAGGACATGGACT 
ATTCACGAATCATTGAGCGTCTCTTAAAGCTGGCGGTCCCCAACCATCTGATATGGCTCA 
TCTTCTTCTATTGGCTTTTCCACTCATGTCTCAATGCTGTGGCAGAGCTCCTGCAGTTTG 
GAGACCGCGAGTTCTACAGGGACTGGTGGAATGCTGAGTCTGTCACCTACTTTTGGCAGA 
ACTGGAATATCCCCGTGCACAAGTGGTGCATCAGACACTTCTACAAGCCTATGCTCAGAC 
TGGGCAGCAACAAATGGATGGCCAGGACTGGGGTCTTTTTGGCGTCAGCCTTCTTCCATG 
AGTACCTAGTGAGCATTCCCCTGAGGATGTTCCGCCTCTGGGCATTCACAGCCATGATGG 
CTCAGGTCCCACTGGCCTGGATTGTGAACCGCTTCTTCCAAGGGAACTATGGCAATGCAG 
CTGTGTGGGTGACACTCATCATTGGGCAACCGGTGGCTGTGCTCATGTATGTCCACGACT 
ACTACGTGCTCAACTATGATGCCCCAGTGGGGGCCTGAGCTACTGCCAAAGGCCAGCCCT 
CCCTAACCTGGGCCTGGAGTTCTGGAGGGCTTCCTGGCTGCCTGCACACTCCTCCTAGTC 
TGGGAGGCCTCTCTGCCCCTATGGGGCCTACTCCTGCTCTTGGGGATGGCACCTGAGTCC 
AGCTGGTATGAGCCAGTGCTGGGAGTCTGTGCTGACCAGGGGCTGAGGATATCAATAAAG 
AGC TATC TAAAAAAAAAAAAAAAAAA 
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MGDRGGAGSSRRRRTGSRVSIQGGSGPMVDEEEVRDAAVGPDLGAGGDAPAPAPVPAPAH 
TRDKDRQTSVGDGHWELRCHRLQDSLFSSDSGFSNYRGILNWCWMLILSNARLFLENLI 
KYGILVDPIQWSLFLKDPYSWPAPCLIIASNIFIVATFQIEKRLSVGALTEQMGLLLHV 

VNL AT I I C F PAAVAL LVE SIT PVGS L F AL AS YS I I F LKLF S YRDVNLWCRQRRVKAKAVS 
AGKKVSGAAAQNTVSYPDNLTYRDLYYFIFAPTLCYELNFPRSPRIRKRFLLRRVLEMLF 
FTQLQVGLIQQWMVPTIQNSMKPFKDMDYSRIIERLLKLAVPNHLIWLIFFYWLFHSCLN 
AVAELLQFGDREFYRDWWNAESVTYFWQNWNIPVHKWCIRHFYKPMLRLGSNKWMARTGV 
FLASAFFHEYLVS I PLRMFRLWAFTAMMAQVPLAWIVNRFFQGNYGNAAVWVTLI IGQPV 
AVLMYVHDYYVLNYDAPVGA 

Figure 11 
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MRQQTGRRRRQPSETSNGSLASSRRSSFAQNGNSSRKSSEMRGPCEKVVHT 

AQDSLFSTSSGWTNFRGFFNLSILLLVLSNGRVALENVIKYGILITPLQWISTFVE 

HHYSIWSWPNLALILCSNIQILSVFGMEKILERGWLGNGFAAVFYTSLVIAHLTIP 

VVVTLTHKWKNPLWSVVMMGVYVIEALKFISYGHVNYWARDARRKITELKTQV 

TDLAKKTCDPKQFWDLKDELSMHQMAAQYPANLTLSNIYYFMAAPTLCYEFKF 

PRLLRIRKHFLIKRTVELIFLSFLIAALVQQWVVPTVRNSMKPLSEMEYSRCLERL 

LKLAIPNHLIWLLFFYTFFHSFLNLIAELLRFADREFYRDFWNAETIGYFWKSWNI 

PVHRFAVRHIYSPMMRNNFSKMSAFFVVFFVSAFFHEYLVSVPLKIFRLWSYYG 

MMGQIPLSIITDKVVRGGRTGNIIVWLSLIVGQPLAILMYGHDWYILNFGVSAVQ 

NQTVGI 

Figure 12 
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TTTGATATGTACGGTAAATGGAAAAAAGGTATTCATGTATGGCAAGGTGGTA 
ATAA ATG GC ACT AA ATATGTTTC A AAAGTGTG AG C A AAC GTATGTG AG AG AC 
GAGAAAAATAAGAAAACGACCTGTAATACATGAAAAATATCAATAGGAATTT 
TGAGATAATTTGGCAACATGCAATATAATGATTATAATAAAAAACTTGTCTTA 
AGACTAGAGAACTGCTAATTCAAAAAAAACAAATTGAGATAAATCAAATACC 
AACG GTTTG GTTTTG AACTG CTG AA AC ACC A AAGTTC AA 



Figure 13 
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TGCAAATGCGTCAACAAACGGGACGACGGCGGCGTCAGCCTTCGGNAAAC 
ATCTAATGGTTCTTTGGCTTCCAGTAGACGCTCCTCATTTGCACAAAATGGT 
AATTCGTCAAGGGAAAAGTTCAGAAATGAGAGGACCTTGCGAGAAAGTGGT 
AC ATACTG CTC AAG ATTC ATTGTTTTC G ACG AGTTCTG G ATG G AC A AATTTC 
CGTGGATTCTTCAATTTGTCTATTTTACTTTTGGTACTTTCAAATGGACGCGT 
GGCACTTGAAAATGTGATCAAATATGGTATTTTGATAACACCCCTTCAGTGG 
ATCTCAACGTTTGTTGAGCATCACTACTCAATTTGGAGCTGGCCAAATCTTG 
CTCTC ATC CTATG CTC AAA 



Figure 14 
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BglH 13358 
Xbald 13465 
Xbal 13465 

Seal 13937 
SnaBl 14079 
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SEQUENCE LISTING 



<110> Lassner , Mike 

Ruezinsky, Diane 



<120> Acyl-CoA :Cholesterol Acyltransf erase Related Nucleic 
Ac i d Sequence s 



<130> 17045/00/WO 



<140> 
<141> 



<150> 60/088,143 
<151> 1998-06-05 



<150> 60/108,389 
<151> 1998-11-12 

<160> 46 



<170> Patentln Ver. 2.0 



<210> 1 
<211> 1942 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 1 

ctctcgtgaa tcctttttcc tttcttcttc ttcttctctt cagagaaaac tttgcttctc 60 
tttctataag gaaccagaca cgaatcccat tcccaccgat ttcttagctt cttccttcaa 120 
tccgctcttt ccctctccat tagattctgt ttcctctttc aatttcttct gcatgcttct 180 
cgattctctc tgacgcctct tttctcccga cgctgtttcg tcaaacgctt ttcgaaatgg 240 
cgattttgga ttctgctggc gctactacgg tgacggagaa cggtggcgga gagttcgtcg 3 00 
atcttgatag gcttcgtcga cggaaatcga gatcggattc ttctaacgga cttcttctct 3 60 
ctggttccga taataattct ccttcggatg atgttggagc tcccgccgac gttagggatc 420 
ggattgattc cgttgttzaac gatgacgctc agggaacagc caatttggcc ggagataata 480 
acggtggtgg cgataataac ggtggtggaa gaggcggcgg agaaggaaga ggaaacgccg 540 
atgctacgtt tacgtatcga ccgtcggttc cagctcatcg gagggcgaga gagagtccac 600 
ttagctccga cgcaatcttc aaacagagcc atgccggatt attcaacctc tgtgtagtag 660 
ttcttattgc tgtaaacagt agactcatca tcgaaaatct tatgaagtat ggttggttga 720 
tcagaacgga tttctggttt agttcaagat cgctgcgaga ttggccgctt ttcatgtgtt 780 
gtatatccct ttcgatcttt cctttggctg cctttacggt tgagaaattg gtacttcaga 840 
aatacatatc agaacctgtt gtcatctttc ttcatattat tatcaccatg acagaggttt 900 
tgtatccagt ttacgtcacc ctaaggtgtg attctgcttt tttatcaggt gtcactttga 960 
tgctcctcac ttgcattgtg tggctaaagt tggtttctta tgctcatact agctatgaca 1020 
taagatccct agccaatgca gctgataagg ccaatcctga agtctcctac tacgttagct 1080 
tgaagagctt ggcatatttc atggtcgctc ccacattgtg ttatcagcca agttatccac 1140 
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gttctgcatg tatacggaag ggttgggtgg ctcgtcaatt tgcaaaactg gtcatattca 1200 

ccggattcat gggatttata acagaacaat acataaatcc tatcgtcagg aactcaaagc 1260 

atcctttgaa aggcgatctt ctatatgcta ttgaaagagt gttgaagctt tcagttccaa 1320 

atttatatgt gtggctctgc atgttctact gcttcttcca cctttggtta aacatattgg 1380 

cagagcttct ctgcttcggg gatcgtgaat tctacaaaga ttzggtggaat gcaaaaagtg 1440 

tgggagatta ctggagaatg tggaatatgc ctgttcataa atggatggtt cgacatatat 1500 

acttcccgtg cttgcgcagc aagataccaa agacactcgc cattatcatt gctttcctag 1560 

tctctgcagt ctttcatgag ctacgcatcg cagttccttg tcgtctcttc aagctatggg 1620 

cttttcttgg gattatgttt caggtgcctt tggtcttcat cacaaactat ctacaggaaa 1680 

ggtttggctc aacggtgggg aacatgatct tctggttcat cttctgcatt ttcggacaac 1740 

cgatgtgtgt gcttctttat taccacgacc cgatgaaccg aaaaggatcg atgtcatgaa 1800 

acaactgttc aaaaaatgac tttcttcaaa catctatggc ctcgttggat ctccgttgat 1860 

gttgtggtgg ttctgatgct aaaacgacaa atagtgttat aaccattgaa gaagaaaaga 1920 

caattagagt tgttgtatcg ca 1942 

<210> 2 
<211> 520 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 
15 10 15 

Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 
20 25 30 

Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 
35 40 45 

Pro Ser Asp Asp Val Gly Ala Pro Ala Asp- Val Arg Asp Arg lie Asp 
50 55 60 

Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 , 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 
85 90 95 

Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu lie 
130 135 140 
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Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Tyr Gly Trp 
145 150 155 160 

Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 
165 170 175 

Pro Leu Phe Met Cys Cys He Ser Leu Ser He Phe Pro Leu Ala Ala 
180 185 190 

Phe Thr Val Glu Lys Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val 
195 200 205 

Val He Phe Leu His He He He Thr Met Thr Glu Val Leu Tyr Pro 
210 215 220 

Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 
225 230 235 240 

Leu Met Leu Leu Thr Cys He Val Trp Leu Lys Leu Val Ser Tyr Ala 
245 250 255 

His Thr Ser Tyr Asp He Arg Ser Leu Ala Asn Ala Ala Asp Lys Ala 
260 265 270 

Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe 
275 280 285 

Met Val Ala Pro Thr Leu Cys Tyr Gin Pro Ser Tyr Pro Arg Ser Ala 
290 295 300 

Cys He Arg Lys Gly Trp Val Ala Arg Gin Phe Ala Lys Leu Val He 
305 * 310 315 320 

Phe Thr Gly Phe Met Gly Phe He He Glu Gin Tyr He Asn Pro lie 
325 330 335 

Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala lie 
340 345 350 

Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 
355 360 365 

Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn He Leu Ala Glu Leu 
370 375 380 

Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 
385 390 395 400 
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- Ser Val Gly Asp 



Met Val Arg His 
420 

Thr Leu Ala lie 
435 

Leu Cys lie Ala 
450 

Gly lie Met Phe 
465 

Glu Arg Phe Gly 



Cys lie Phe Gly 
500 

Met Asn Arg Lys 
515 



Tyr Trp Arg Met 
405 

lie Tyr Phe Pro 



He He Ala Phe 
440 

Val Pro Cys Arg 
455 

Gin Val Pro Leu 
470 

Ser Thr Val Gly 
485 

Gin Pro Met Cys 



Gly Ser Met Ser 
520 



Trp Asn Met Pro 
410 

Cys Leu Arg Ser 
425 

Leu Val Ser Ala 



Leu Phe Lys Leu 
460 

Val Phe He Thr 
475 

Asn Met He Phe 
490 

Val Leu Leu Tyr 
505 



Val His Lys Trp 
415 

Lys He Pro Lys 
430 

Val Phe His Glu 
445 

Trp Ala Phe Leu 



Asn Tyr Leu Gin 
480 

Trp Phe lie Phe 
495 

Tyr His Asp Leu 
510 



<210> 3 

<211> 234 

<212> DNA 

<213> Glycine max 

<400> 3 

gtaagcttca agagcttagc atanttcctg gttgccccta ncattatgtt accagccaan 60 

ctatcctcgc acaccttata ttcgaaaggg ttggctgttt cgccaacttg tcaactgata 120 

atatttacag gagttatggg atttataata gaacaataca ttaatcccat tgtacaaaat 180 

tcacagcatc ctctcaaggg aaaccttctt tacgccatcg agagagttct gaag 234 

<210> 4 

<211> 267 

<212> DNA 

<213> Glycine max 

<400> 4 

ctgcttttgt atctggtgtc acgttgatgc tattaacttg cattgtgtgg ttaaaattgg 60 

tgtcatatgc acatacaaac tatgatatga gagcacttac tgtttcgaat gaaaagggag 120 

aaacattacc caatactttg atatggagta tccgtacact gtgaccttca ggagtttggc 180 

atacttcatg gttgctccta cattatgcta tcagacaagc tatcctcgca caccttcagt 240 

tcgaaagggt tgggtgtttc gtcaact 267 
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<210> 5 

<211> 275 

<212> DNA 

<213> Glycine max 

<400> 5 

gtggaatgcc aaaactgttg aagattattg 
gatgatccgc cacctatatt ttccatgttt 
tttaattgcc ttcctggttc tgctttattc 
catattcaag tngtgggttt cngnggaatt 
attnntnggc naaaaaattc cnngaacccc 



gaggatgtgg aatatgcctg ttcacaaatg 60 
aaggcacggt ataccaaagg ccgttgctcc 120 
catgagctgt gcatcgctgt tccttgccca 180 
nagtttcagg tnccttgggt ttcnaccnna 240 

ggggg 275 



<210> 6 

<211> 257 

<212> DNA 

<213> Glycine max 



<400> 6 

aacggaattg agactccaga gaatatgcca 
ggcttttgga aaaactggca tgcttccttc 
cctcttgggg gatctaagaa aaagctacta 
atctggcatg atttagagtg gaagcttctt 
atccctgagt tggtttt 



aaatgtatta ataattgtca caacttggaa 60 
aacaagtggc ttgtgaggta tatatacatt 120 
aatgtgtggg ttgttttcac atttgttgca 180 
tcatgggcat ggttgacgtg tttattcttc 240 

257 



<210> 7 
<211> 253 
<212> DNA 
<213> Zea mays 

<400> 7 

agaaaatgga acatgcctgt gcataaatgg attgttcgtc atatatattt tcct tgcang 60 

cgaaatggta tatcaaagga agttgctgtt tttatatcgt tcttgt t tct gctgtacttc 120 

atgagttatg rgttgctgtt ccctgccaca tactcaagtt ctgggctttt tttaggaauc 180 

atgcttcaga ttcccctcat catattgaca tcatacctca aaaataaact cagtgacaca 240 

atggttggca ata 253 



<210> 8 
<211> 254 
<212> DNA 
<213> Zea mays 

<400> 8 

tgaagtatgg cttattaata agatctggct tttggtttaa tgctacacca ttgcgagact 60 
ggccactgct aatgtgttgc cttagtctac ccatatttcc ccttggtgca tttgcagtcg 120 
aaaagttggc attcaacaat ctcattagtg atcctgctac tacctgtttt cacatccttt 180 
ttacaacatt tgaaattgta tatccagtgc tcgtgattct taagtgtgat tctgcagttt 240 
tatcaggctt tgtg 254 
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- <210> 9 

<211> 262 

<212> DNA 

<213> Zea mays 

<400> 9 

gaagtatggc ttatcaacaa gatccggcCt ctggtttaat gctacaccat tgcgagactg 60 

gccactgcta atgtgttgcc ttagtctacc catatttccc cttggtgcat ttgcagtcga 120 

aaagttggca ttcaacaatc tcattagtga tcctgctact acctgttttc acatcctttt 180 

tacaacattt gaaattgtat atccagtgct cgtgattctt aagtgtgatt ctgcagtttt 240 

acaggctttg tgttgatgtc ta 262 

<210> 10 

<211> 325 

<212> DNA 

<213> Zea mays 

<400> 10 

taatcnaacc tcgntncngg ctcagctgta cnccatgaga tatgtaacgc ggcgccgtgc 60 

cacatantca natctnggca tnncngggat catngttcag acaccgncgg nattcttgac 120 

aagatatctc catgctacgt tcaagcacgc aatggtgggc aacatgatan tttggntctn 180 

cagtatagtc ggacagccga tgtnnnnnna tctatactac catgacgtca tgaacaggca 240 

ggcccaggca agtagatagt ncggcagaga catgtacttc aacatcganc atcagnagca 300 

nacngagcga gcggcangaa ncagc 325 

<210> 11 
<211> 519 
<212> DNA 

<213> Mortierella alpina 

<400> 11 ... 

gagnnnngna acgtttagcc tnccgtagcc gccaaaatcc aagggncnac cnaccctncg 60 

ttanactnaa ctngaaaacn cnnncccaac ttnaggnact tnnagncccc ccnacttgac 120 

aacggagcac tatatttacc ccgtggtngt tcaacccagc cacctcaccc ttgcgagcat 180 

tggtgctgcc cttgataccc ttcacgct ta actatctcat gatcttttac atcattttcg 240 

agtgcatctg caacgccttt gcggaactaa gttgctttgc ggatcgcaac ttttacgagg 300 

attggtggaa ctgcgtcagc tttgatgagt gggcacgcaa atggaacaag cctgtgcaac 360 

acttcttgct ccgccacgtg tacgactcga gcatccgagt ccttccactt gtccgaaatc 420 

caatgccgcn aattgcaaac gttccttccc ggtcgtcaat gcgttcaacg aacctgggtg 480 

aagaatgggt ggtgacaacg ttaaagtgcg cccggtatc 519 

<210> 12 
<211> 518 
<212> DNA 
<213> murine 

<400> 12 

tggaggacaa cgcggggtct gatacgactc actataggga atttggccct cgagcagtag 60 
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attcggcacg atgggcacga 
gtcaacctgt ggtgccgcca 
gtcagtgggg ctgctgcgag 
tcgattactt catctttgct 
gaatacgaga gcgctttctg 
aagtggggct gatccaacag 
caagagcttc tgcagttttg 
tgtcaccgac ttttggcaga 



ggactccatc atgttcctca 
gcgaagggtc aaggccaaag 
caagctgtga gctatccaga 
cctactttgt gttatgaact 
ctacgacgag ttcttgagat 
tggatggtcc ctactatcca 
gagaccgcga gttctacaga 
actggaatat ccccgtgg 



agctttattc ccaccgggat 120 
ctgtctctac agggaagaag 180 
caacctgacc taccgagatc 240 
caactttcct cggtcccccc 300 
gctcttttct acccagcttc 360 
gaactccatg gaagcccttt 420 
gattggtgga atgctgagtc 480 

518 



<210> 13 
<211> 299 
<212> DNA 
<213> murine 



<400> 13 

ccatgatggc tcaggtccca ctggcctgga 
gcaatgcagc tgtgtgggtg acactcatca 
ccacgactac tacgtgctca actacgatgc 
agccctccct aacctgggcc tggagttctg 
ctagtctggg aggcctctct gcccctatgc 



ttgtgggccg attcttccaa gggaactatg 60 

ttgggcaacc ggtggctgt'c tcatgtatgt 120 

cccagtgggt catgagctac tgccaaaggc 180 

gaggggttcc tggctgcctg cacactcctc 240 

gctactcctg ctcttgggga tggcatttg 299 



<210> 14 
<211> 1895 
<212> DNA 
<213> Human 



<400> 14 ^ . n 

atctggtgtg atggggacag ggagggac.c ccccttaccc agcactggtg ttggctgagg ,0 
ngggtgctga gtctcagagc ttggcatgga gaccagacag ggctgggtct gcaagcctga 120 
ggctaccgcc ctgagctcgg gctgggacgt gcccagaggt gttgggagga tctggggtga x80 
gtac^ctgtg gccaggacta aaggggctnc accctcccgt ccatccctcg cagaccttga 240 
gcaatgcccg gttatttctg gagaacctca tcaagtatgg catcctggtg gaccccaccc 300 
aggtggtttc tctgttcctg aaggacccc. atagctggcc cgccccatgc ctggttattg 360 
cggccaatgt ctttgctgtg gctgcattcc aggntgagaa gcgccnggcg gtgggtgccc 420 
tgacggagca ggcgggactg ctgctgcacg tggccaacct ggccaccatc ctgtgtttcc 480 
cagcggctgt ggtcttactg gttgagtcta tcactccagt gggctccctg ctggcgctga 540 
tggcgcacac catcctcttc ctcaagctct tctcctaccg cgacgtcaac tcatggtgcc 600 
gcagggccag ggccaaggct gcctctgcag ggaagaaggc cagcagtgct gctgccccgc 660 
acaccgtgag ctacccggac aatctgacct accgcgatct ctactacttc ctcttcgccc 720 
ccaccttgtg ctacgagctc aactttcccc gctctccccg catccggaag cgctttctgc 780 
tgcgacggat ccttgagatg ctgttcttca cccagcccca ggtggggctg atccagcagt 840 
ggatggtccc caccatccag aactccatga agcccttcaa ggacatggac tactcacgca 900 
tcatcgagcg cctcctgaag ctggcggtcc ccaatcacct catctggctc atcttcttcc 960 
actggctctt ccactcctgc ctgaatgccg tggctgagct catgcagttt ggagaccggg 1020 
agttctaccg ggactggtgg aactccgagt ctgtcaccta cttctggcag aactggaaca 1080 
tccctgtgca caagtggtgc atcagacact tctacaagcc catgcttcga cggggcagca 1140 
gcaagtggat ggccaggaca ggggtgttcc tggcctcggc cttcttccac gagtacctgg 1200 
tgagcgtccc tctgcgaatg ttccgcctct gggcgttcac gggcatgatg gctcagatcc 1260 
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-cactggcctg gttcgtgggc cgctttttcc agggcaacta tggcaacgca gctgtgtggc 1320 

tgtcgctcat catcggacag ccaatagccg tcctcatgta cgtccacgac tactacgtgc 1380 

tcaactatga ggccccagcg gcagaggcct gagctgcacc tgagggcccg gcttctcact 1440 

gccacctcac acccgctgcc agagcccacc tctcctccta ggcctcgagt gctggggatg 1500 

ggcctggctg cacagcatcc tcctctggtc ccagggaggc ctctctgccc ctatggggct 1560 

ctgtcctgca cccctcaggg atggcgacag caggccagac acagtctgat gccagctggg 1620 

agtcttgctg accctgcccc gggtccgagg gtgtcaataa agtgctgccc agtgacctct 1680 

tcagcctgcc aggggcctgg ggcctggtgg ggggtatggc cacacccaca agggcgagtg 1740 

ccagagctgt gtggacagct gtcccaggac ctgccgggga gcagcagccc cactgcagca 1800 

gggcgggcat ggccggtagg gggagtgcaa ggccaggcag acgcccccat tccccacact 1860 

cccctaccta gaaaagctca gctcaggcgt cctct 1895 

<210> 15 
<211> 1766 
<212> DNA 
<213> Rattus sp. 

<400> 15 

cacgactggg ccgcgacgtg gcgcgggccg aagccatggg cgaccgcgga ggcgcgggaa 60 
gctctcggcg tcggaggacc ggctcgcggg tttccatcca gggtggtagt gggcccatgg 120 
tagacgaaga ggaggtgcga ■ gacgccgc tg tgggccccga cttgggcgcc gggggtgacg 180 
ctccggctcc ggctccggtt ccggctccag cccacacccg ggacaaagac cggcagacca 240 
gcgtgggcga cggccactgg gagctgaggt gccatcgtct gcaagactct ttgttcagct 300 
cagacagcgg tttcagcaat taccgtggta tcctgaattg gtgcgtggtg atgctgatcc 360 
tgagtaatgc aaggttattt ttagagaatc ttatcaagta tggcatcctg gtggatccca 420 
tccaggtggt gtctctgttt ctgaaggacc cctacagctg gcctgcccca tgcttgatca 480 
ttgcatccaa tatctttatt gtggctacat ttcagattga gaagcgcctg tcagtgggtg 540 
ccctgacaga gcagatgggg ctgctgctac atgcggttaa cccggccaca attatctgct 600 
tcccagcagc tgtggcctta ctggttgagt ctatcactcc agtgggttcc ctgtttgctc 660 
tggcatcata ctccatcatc ttcctcaagc ttttctccta ccgggatgtc aatctgtggt 720 
gccgccagcg aagggtcaag gccaaagctg tgtctgcagg gaagaaggcc agtggggctg 780 
ctgcccagaa cactgtaagc tatccggaca acctgaccta ccgagatctc tattactcca 840 
tctttgctcc tactttgtgt tatgaactca actttcctcg atccccccga atacgaaagc 900 
gctttctgct acggcgggtt cttgagatgc tctttttcac ccagcttcaa gtggggctga 960 
tccagcagtg gatggtccct actatccaga actccatgaa gcccttcaag gacatggact 1020 
attcacgaat cattgagcgt ctcttaaagc tggcggtccc caaccatctg atatggctca 1080 
tcttcttcta ttggcttttc cactcatgtc tcaatgctgt ggcagagctc ctgcagtttg 1140 
gagaccgcga gttctacagg gactggtgga atgctgagtc tgtcacctac ttttggcaga 1200 
actggaatat ccccgtgcac aagtggtgca tcagacactt ctacaagcct atgctcagac 1260 
tgggcagcaa caaatggatg gccaggactg gggtcttttt ggcgtcagcc ttcttccatg 1320 
agtacctagt gagcattccc ctgaggatgt tccgcctctg ggcattcaca gccatgatgg 1380 
ctcaggtccc actggcccgg attgtgaa.cc gcttcttcca agggaactat ggcaatgcag 1440 
ctgtgtgggt gacactcatc attgggcaac cggtggctgt gctcatgtat gtccacgact 1500 
actacgtgct caactatgat gccccagtgg gggcctgagc tactgccaaa ggccagccct 1560 
ccctaacctg ggcctggagt tctggagggc ttcctggctg cctgcacact cctcctagtc 1620 
tgggaggcct ctctgcccct atggggccta ctcctgctct tggggatggc acctgagtcc 1680 
agctggtatg agccagtgct gggagtctgt gctgaccagg ggctgaggat atcaataaag 1740 
agctatctaa aaaaaaaaaa aaaaaa 1766 

8 



BNSDOCID: <WO 9963096A2 I > 



1 



WO 99/63096 



PCT/US99/12541 



<210> 16 
<211> 1766 
<212> DNA 
<213> Rattus sp. 

<400> 16 

cacgactggg ccgcgacgtg gtgcgggccg aagccatggg cgaccgcgga ggcgcgggaa 60 
gctctcggcg tcggaggacc ggctcgcggg tttccatcca gggtggtagt gggcccatgg 12 0 
tagacgaaga ggaggtgcga gacgccgctg tgggccccga cttgggcgcc gggggtgacg 180 
ctccggctcc ggctccggtt ccggctccag cccacacccg ggacaaagac cggcagacca 240 
gcgtgggcga cggccactgg gagctgaggt gccatcgtct gcaagactct ttgttcagct 3 00 
cagacagcgg tttcagcaat taccgtggta tcctgaattg gtgcgtggtg atgctgatcc 3 60 
tgagtaatgc aaggttattt ttagagaatc ttatcaagta tggcatcctg gtggatccca 420 
tccaggtggt gtctctgttt ctgaaggacc cctacagctg gcctgcccca tgcttgatca 480 
ttgcatccaa tatctttatt gtggctacat ttcagattga gaagcgcctg tcagtgggtg 540 
ccctgacaga gcagatgggg ctgctgctac atgtggttaa cctggccaca attatctgct 600 
tcccagcagc tgtggcctta ctggttgagt ctatcactcc agtgggttcc ctgtttgctc 660 
tggcatcata ctccatcatc ttcctcaagc ttttctccta ccgggatgtc aatctgtggt 720 
gccgccagcg aagggtcaag gccaaagctg tgtctgcagg gaagaaggtc agtggggctg 780 
ctgcccagaa cactgtaagc tatccggaca acctgaccta ccgagatctc tattacttca 840 
tctttgctcc tactttgtgt tatgaactca actttcctcg atccccccga atacgaaagc 900 
gctttctgct acggcgggtt cttgagatgc tctttttcac ccagcttcaa gtggggctga 960 
tccagcagtg gatggtccct actatccaga actccatgaa gcccttcaag gacatggact 1020 
attcacgaat cattgagcgt ctcttaaagc tggcggtccc caaccatctg atatggctca 1080 
tcttcttcta ttggcttttc cactcatgtc tcaatgctgt ggcagagctc ctgcagtttg 1140 
gagaccgcga gttctacagg gactggtgga atgctgagtc tgtcacctac ttttggcaga 1200 
actggaatat ccccgtgcac aagtggtgca tcagacactt ctacaagcct atgctcagac 1260 
tgggcagcaa caaatggatg gccaggactg gggtcttttt ggcgtcagcc ttcttccatg 1320 
agtacctagt gagcattccc ctgaggatgt tccgcctctg ggcattcaca gccatgatgg 1380 
ctcaggtccc actggcctgg attgtgaacc gcttcttcca agggaactat ggcaatgcag 1440 
ctgtgtgggt gacactcatc attgggcaac cggtggctgt gctcatgtat gtccacgact 1500 
actacgtgct caactatgat gccccagtgg gggcctgagc tactgccaaa ggccagccct 1560 
ccctaacctg ggcctggagt tctggagggc ttcctggctg cctgcacact cctcctagtc 1620 
tgggaggcct ctctgcccct atggggccta ctcctgctct tggggatggc acctgagtcc 1680 
agctggtatg agccagtgct gggagtctgt gctgaccagg ggctgaggat atcaataaag 1740 
agctatctaa aaaaaaaaaa aaaaaa 1766 

<210> 17 
<211> 500 
<212> PRT 
<213> Rattus sp. 

<400> 17 

Met Gly Asp Arg Gly Gly Ala Gly Ser Ser Arg Arg Arg Arg Thr Gly 
15 10 15 

Ser Arg Val Ser lie Gin Gly Gly Ser Gly Pro Met Val Asp Glu Glu 



9 



WO 99/63096 PCI7US99/12541 

20 25 " 30 

Glu Val Arg Asp Ala Ala Val Gly Pro Asp Leu Gly Ala Gly Gly Asp 
35 40 45 

Ala Pro Ala Pro Ala Pro Val Pro Ala Pro Ala His Thr Arg Asp Lys 
50 55 60 

Asp Arg Gin Thr Ser Val Gly Asp Gly His Trp Glu Leu Arg Cys His 
65 70 75 80 

Arg Leu Gin Asp Ser Leu Phe Ser Ser Asp Ser Gly Phe Ser Asn Tyr 
85 90 95 

Arg Gly lie Leu Asn Trp Cys Val Val Met Leu lie Leu Ser Asn Ala 
100 105 110 

Arg Leu Phe Leu Glu Asn Leu lie Lys Tyr Gly lie Leu Val Asp Pro 
115 120 125 

He Gin Val Val Ser Leu Phe Leu Lys Asp Pro Tyr Ser Trp Pro Ala 
130 135 • 140 

Pro Cys Leu He He Ala Ser Asn lie Phe He Val Ala Thr Phe Gin 
145 150 155 160 

He Glu Lys Arg Leu Ser Val Gly Ala Leu Thr Glu Gin Met Gly Leu 
165 170 175 

Leu Leu His Val Val Asn Leu Ala Thr lie He Cys Phe Pro Ala Ala 
180 185 190 

Val Ala Leu Leu Val Glu Ser He Thr Pro Val Gly Ser Leu Phe Ala 
195 200 205 

Leu Ala Ser Tyr Ser lie He Phe Leu Lys Leu Phe Ser Tyr Arg Asp 
210 215 220 

Val Asn Leu Trp Cys Arg Gin Arg Arg Val Lys Ala Lys Ala Val Ser 
225 230 235 240 

Ala Gly Lys Lys Val Ser Gly Ala Ala Ala Gin Asn Thr Val Ser Tyr 
245 250 255 

Pro Asp Asn Leu Thr Tyr Arg Asp Leu Tyr Tyr Phe lie Phe Ala Pro 
260 265 270 

Thr Leu Cys Tyr Glu Leu Asn Phe Pro Arg Ser Pro Arg lie Arg Lys 
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275 280 285 

Arg Phe Leu Leu Arg Arg Val Leu Glu Met Leu Phe Phe Thr Gin Leu 
290 295 300 

Gin Val Gly Leu He Gin Gin Trp Met Val Pro Thr He Gin Asn Ser 
305 310 315 320 

Met Lys Pro Phe Lys Asp Met Asp Tyr Ser Arg He He Glu Arg Leu 
325 330 335 

Leu Lys Leu Ala Val Pro Asn His Leu He Trp Leu He Phe Phe Tyr 
340 345 350 

Trp Leu Phe His Ser Cys Leu Asn Ala Val Ala Glu Leu Leu Gin Phe 
355 360 365 

Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Glu Ser Val Thr 
370 375 380 

Tyr Phe Trp Gin Asn Trp Asn He Pro Val His Lys Trp Cys He Arg 
385 390 395 400 

His Phe Tyr Lys Pro Met Leu Arg Leu Gly Ser Asn Lys Trp Met Ala 
405 410 415 

Arg Thr Gly Val Phe Leu Ala Ser Ala Phe Phe His Glu Tyr Leu Val 
420 425 430 

Ser He Pro Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Ala Met Met 
. 435 440 445 

Ala Gin Val Pro Leu Ala Trp He Val Asn Arg Phe Phe Gin Gly Asn 
450 455 460 

Tyr Gly Asn Ala Ala Val Trp Val Thr Leu He He Gly Gin Pro Val 
465 470 475 480 

Ala Val Leu Met Tyr Val His Asp Tyr Tyr Val Leu Asn Tyr Asp Ala 
485 490 495 

Pro Val Gly Ala 
500 



<210> 18 
<211> 496 
<212> PRT 
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<213> Caenorhabditis elegans 
<400> 18 

Met Arg Gin Gin Thr Gly Arg Arg Arg Arg Gin Pro Ser Glu Thr Ser 
15 10 15 

Asn Gly Ser Leu Ala Ser Ser Arg Arg Ser Ser Phe Ala Gin Asn Gly 
20 25 30 

Asn Ser Ser Arg Lys Ser Ser Glu Met Arg Gly Pro Cys Glu Lys Val 
35 40 45 

Val His Thr Ala Gin Asp Ser Leu Phe Ser Thr Ser Ser Gly Trp Thr 
50 55 60 

Asn Phe Arg Gly Phe Phe Asn Leu Ser lie Leu Leu Leu Val Leu Ser 
65 v 70 75 80 

Asn Gly Arg Val Ala Leu Glu Asn Val lie Lys Tyr Gly lie Leu lie 
85 90 95 

Thr Pro Leu Gin Trp He Ser Thr Phe Val Glu His His Tyr Ser He 
100 105 HO 

Trp Ser Trp Pro Asn Leu Ala Leu He Leu Cys Ser Asn He Gin He 
115 120 125 

Leu Ser Val Phe Gly Met Glu Lys He Leu Glu Arg Gly Trp Leu Gly 
130 135 140 

Asn Gly Phe Ala Ala Val Phe Tyr Thr Ser Leu Val lie Ala His Leu 
145 150 155 160 

Thr He Pro Val Val Val Thr Leu Thr His Lys Trp Lys Asn Pro Leu 
165 170 175 

Trp Ser Val Val Met Met Gly Val Tyr Val He Glu Ala Leu Lys Phe 
180 185 190 

He Ser Tyr Gly His Val Asn Tyr Trp Ala Arg Asp Ala Arg Arg Lys 
195 200 205 

He Thr Glu Leu Lys Thr Gin Val Thr Asp Leu Ala Lys Lys Thr Cys 
210 215 220 

Asp Pro Lys Gin Phe Trp Asp Leu Lys Asp Glu Leu Ser Met His Gin 
225 230 235 240 
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Met Ala Ala Gin Tyr Pro Ala Asn Leu Thr Leu Ser Asn lie Tyr Tyr 
245 250 255 

Phe Met Ala Ala Pro Thr Leu Cys Tyr Glu Phe Lys Phe Pro Arg Leu 
260 265 270 

Leu Arg lie Arg Lys His Phe Leu lie Lys Arg Thr Val Glu Leu He 
275 280 285 

Phe Leu Ser Phe Leu He Ala Ala Leu Val Gin Gin Trp Val Val Pro 
290 295 300 

Thr Val Arg Asn Ser Met Lys Pro Leu Ser Glu Met Glu Tyr Ser Arg 
305 310 315 320 

Cys Leu Glu Arg Leu Leu Lys Leu Ala He Pro Asn His Leu He Trp 
325 330 335 

Leu Leu Phe Phe Tyr Thr Phe Phe His Ser Phe Leu Asn Leu He Ala 
340 345 350 

Glu Leu Leu Arg Phe Ala Asp Arg Glu Phe Tyr Arg Asp Phe Trp Asn 
355 360 365 

Ala Glu Thr He Gly Tyr Phe Trp Lys Ser Trp Asn He Pro Val His 
370 375 380 

Arg Phe Ala Val Arg His He Tyr Ser Pro Met Met Arg Asn Asn Phe 
385 390 395 400 

Ser Lys Met Ser Ala Phe Phe Val Val Phe Phe Val Ser Ala Phe Phe 
405 410 415 

His Glu Tyr Leu Val Ser Val Pro Leu Lys He Phe Arg Leu Trp Ser 
420 425 430 

Tyr Tyr Gly Met Met Gly Gin He Pro Leu Ser He He Thr Asp Lys 
435 440 445 

Val Val Arg Gly Gly Arg Thr Gly Asn He He Val Trp Leu Ser Leu 
450 455 460 

He Val Gly Gin Pro Leu Ala He Leu Met Tyr Gly His Asp Trp Tyr 
465 470 475 480 

He Leu Asn Phe Gly Val Ser Ala Val Gin Asn Gin Thr Val Gly He 
485 490 495 
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<210> 19 
<211> 300 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 19 

tttgatatgt acggtaaacg gaaaaaaggt attcatgtat ggcaaggtgg taataaatgg 60 
cactaaatat gtttcaaaag tgtgagcaaa cgtatgtgag agacgagaaa aataagaaaa 120 
cgacctgtaa cacatgaaaa atatcaatag gaattttgag ataatttggc aacacgcaat 180 
ataatgatta taataaaaaa cttgtcttaa gactagagaa ctgctaattc aaaaaaaaca 240 
aattgagata aatcaaatac caacggtttg gttttgaact gccgaaacac caaagttcaa 300 

<210> 20 
<211> 381 
<212> DNA 

<213> Caenorhabditis elegans 
<400> 20 

tgcaaatgcg ccaacaaacg ggacgacggc ggcgtcagcc ttcggnaaac atctaatggt 60 
tctttggctt ccagtagacg ctcctcattt gcacaaaatg gtaattcgcc aagggaaaag 120 
ttcagaaatg agaggacctt gcgagaaagt ggtacatact gctcaagatt cattgttttc 180 
gacgagttct ggatggacaa atttccgtgg attcttcaat ttgtctattt tacttttggt 240 
actttcaaat ggacgcgtgg cacttgaaaa tgtgatcaaa tatggtattt tgataacacc 300 
ccttcagtgg acctcaacgt ttgttgagca ncactactca acttggagct ggccaaatcc 360 
tgctctcatc ceatgctcaa a 381 

<210> 21 
<211> 6 
<212> PRT 

<213> Conserved Peptide Sequence 
<400> 21 

Met Lys Ala His Ser Phe 
1 5 



<210> 22 

<211> 250 

<212> DNA 

<213> Human 

<300> 

<301> Chang, 

<303> J. Biol. Chem. 
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<304> 268 

<306> 20747-20755 

<307> 1993 



<400> 22 

rrsdvdhrty hmastwdyd grvssyagkt wwtwwmsts vyhwrcgysk sshrshgmgv 60 
ggtywayta srrvmkahsv rnvrvnsakk sstvtvnyya tyrdsyrntv rwgyvamkav 120 
gcyvyyrcar nksarwcvn sgvtahcwna amrgdrmykd wwnscsysny yrtwnwvhd 180 
wvyyaykdws krksaamava vsawhyaav csywmgman vndsrkkwnv mwtsgngvcy 240 

* . 250 
swyarrhckn 



<210> 23 
<211> 409 
<212> PRT 
<213> murine 



<300> 

<301> Uelman , 

<303> J. Biol. Chem. 

<304> 270 

<306> 26192-26201 

<307> 1995 



<400> 23 
Arg Gin Ser 
1 

lie Tyr His 



lie Val Val 
35 

Leu Leu Ala 
50 

Trp Ala Met 
65 

Pro Trp Ala 



Leu Val His 



Phe Val Pro 
115 



Leu Leu Asp 
5 

Met Phe lie 
20 

Asp Tyr lie 



Tyr Ala Phe 



Phe Leu Ser 
70 

His Gly Tyr 
85 

Gly Leu Leu 
100 

Thr Tyr Val 



Glu Leu Phe 



Ala Leu Leu 
25 

Asp Glu Gly 
40 

Gly Lys Phe 
55 

Thr Leu Ser 



Ser Lys Ser 



Phe Leu Val 
105 

Val Leu Ala 
120 



Glu Val Asp His 
10 

lie Leu Phe Val 



■Arg^. Leu Val Leu 
45 

Pro Thr Val lie 
60 

lie Pro Tyr Phe 
75 

Ser His Pro Leu 
90 

Phe Gin Leu Gly 



Tyr Thr Leu Pro 
125 



lie Arg Thr 
15 

Leu Ser Thr 
30 

Glu Phe Asn 



Trp Thr Trp 



Leu Phe Gin 
80 

lie Tyr Ser 
95 

Val Leu Gly 
110 

Pro Ala Ser 
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Arg Phe lie Leu He Leu Glu Gin He Arg Leu He Met Lys Ala His 
130 . 135 140 

Ser Phe Val Arg Glu Asn He Pro Arg Val Leu Asn Ala Ala Lys Glu 
145 150 155 160 

Lys Ser Ser Lys Asp Pro Leu Pro Thr Val Asn Gin Tyr Leu Tyr Phe 
165 170 175 

Leu Phe Ala Pro Thr Leu He Tyr Arg Asp Asn Tyr Pro Arg Thr Pro 
180 185 190 

Thr Val Arg Trp Gly Tyr Val Ala Met Gin Phe Leu Gin Val Phe Gly 
195 200 205 

Cys Leu Phe Tyr Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu 
210 215 220 



Leu 



Phe Arg Asn He Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val 
225 230 235 2 4o 

Cys Val Phe Asn Ser He Leu Pro Gly Val Leu He Leu Phe Leu Ser 
245 250 255 

Phe Phe Ala Phe Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu 
260 265 270 

Arg Phe Gly Asp Arg Met Phe Tyr . Lys Asp Trp Trp Asn Ser Thr Ser 
275 280 285 

Tyr Ser Asn Tyr Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu 
290 295 300 

Tyr Tyr Tyr Val Tyr Lys Asp Leu Leu Trp Phe Phe Ser Lys Arg Phe 
305 31 ° 3 15 320 

Lys Ser Ala Ala Met Leu Ala Val Phe Ala Leu Ser Ala Val Val His 
325 330 335 

Glu Tyr Ala Leu Ala lie Cys Leu Ser Tyr Phe Tyr Pro Val Leu Phe 
340 345 35Q 

Val Leu Phe Met Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp 
355 360 365 

Ser Arg Lys Arg Pro He Trp Asn He Met Val Trp Ala Ser Leu Phe 
370 375 380 
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Leu Gly Tyr Gly Leu lie Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala 
385 390 395 400 

Arg Gin His Cys Pro Leu Lys Asn Pro 
405 



<210> 24 
<211> 429 
<212> PRT 
<213> yeast 

<300> 
<301> Yu, 

<303> J. Biol. Chem. 
<304> 271 
<306> 24157-24163 
<307> 1996 

<400> 2.4 

Asp Lys Ala Asp Ala Pro Pro Gly Glu Lys Leu Glu Ser Asn Phe Ser 
15 10 15 

Gly lie Tyr Val Phe Ala Trp Met Phe Leu Gly Trp lie Ala lie Arg 
20 25 30 

Cys Cys Thr Asp Tyr Tyr Ala Ser Tyr Gly Ser Ala Trp Asn Lys Leu 
35 40 45 

Glu lie Val Gin Tyr Met Thr Thr Asp Leu Phe Thr lie Ala Met Leu 
50 55 60 

Asp Leu Ala Met Phe Leu Cys Thr Phe Phe Val Val Phe Val His Trp 
65 70 75 80 

Leu Val Lys Lys Arg lie lie Asn Trp Lys Trp Thr Gly Phe Val Ala 
85 90 95 

Val Ser lie Phe Glu Leu Ala Phe lie Pro Val Thr Phe Pro lie Tyr 
100 105 110 

Val Tyr Tyr Phe Asp Phe Asn Trp Val Thr Arg lie Phe Leu Phe Leu 
115 120 125 

His Ser Val Val Phe Val Met Lys Ser His Ser Phe Ala Phe Tyr Asn 
130 135 140 



Gly Tyr Leu Trp Asp lie Lys Gin Glu Leu Glu Tyr Ser Ser Lys Gin 
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145 150 155 160 

Leu Gin Lys Tyr Lys Glu Ser Leu Ser Pro Glu Thr Arg Glu He Leu 
165 170 175 

Gin Lys Ser Cys Asp Phe Cys Leu Phe Glu Leu Asn Tyr Gin Thr Lys 
130 185 190 

Asp Asn Asp Phe Pro Asn Asn He Ser Cys Ser Asn Phe Phe Met Phe 
195 200 205 

Cys Leu Phe Pro Val Leu Val Tyr Gin He Asn Tyr Pro Arg Thr Ser 
210 215 220 

Arg He Arg Trp Arg Tyr Val Leu Glu Lys Val Cys Ala He He Gly 
225 230 235 240 

Thr He Phe Leu Met Met Val Thr Ala Gin Phe Phe Men His Pro Val 
245 250 255 

Ala Met Arg Cys He Gin Phe His Asn Thr Pro Thr Phe Gly Gly Trp 
260 265 270 

He Pro Ala Thr Gin Glu Trp Phe His Leu Leu Phe Asp Met He Pro 
275 280 285 

Gly Phe Thr Val Leu Tyr Met Leu Thr Phe Tyr Met He Trp Asp Ala 
290 295 300 

Leu Leu Asn Cys Val Ala Glu Leu Thr Arg Phe Ala Asp Arg Tyr Phe 
305 310 315 320 

Tyr Gly Asp Trp Trp Asn Cys Val Ser Phe Glu Glu Phe Ser Arg lie 
325 330 335 

Trp Asn Val Pro Val His Lys Phe Leu Leu Arg His Val Tyr His Ser 
340 345 350 

Ser Met Gly Ala Leu His Leu Ser Lys Ser Gin Ala Thr Leu Phe Thr 
355 360 365 

Phe Phe Leu Ser Ala Val Phe His Glu Met Ala Met Phe Ala He Phe 
370 375 380 

Arg Arg Val Arg Gly Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val 
385 390 395 400 

Trp Thr Ala Leu Ser Asn Thr Lys Phe Leu Arg Ala Arg Pro Gin Leu 
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405 410 415 

Ser Asn Val Val Phe Ser Phe Gly Val Cys Ser Gly Pro 
420 425 



<210> .25 
<211> 432 
<212> PRT 
<213> yeast 

<300> 

<301> Yang, 
<303> Science 
<304> 272 
<306> 1353-1356 
<307> 1996 



<400> 25 

Glu Thr Val Val Thr Val Glu Thr Thr He He Ser Ser Asn Phe Ser 
1 5 10 15 

Gly Leu Tyr Val Ala Phe Trp Met Ala He Ala Phe Gly Ala Val Lys 
20 25 30 

Ala Leu He Asp Tyr Tyr Tyr Gin His Asn Gly Ser Phe Lys Asp Ser 
35 40 45 

Glu He Leu Lys Phe Met Thr Thr Asn Leu Phe Thr Val Ala Ser Val 
50 55 60 

Asp Leu Leu Met Tyr Leu Ser Thr Tyr Phe Val Val Gly He Gin Tyr 
65 70 75 80 

Leu Cys Lys Trp Gly Val Leu Lys Trp Gly Thr Thr Gly Trp He Phe 
85 90 95 

Thr Ser He Tyr Glu Phe Leu Phe Val He Phe Tyr Met Tyr Leu Thr 
100 105 HO 

Glu Asn He Leu Lys Leu His Trp Leu Ser Lys He Phe Leu Phe Leu 
115 120 125 

His Ser Leu Val Leu Leu Met Lys Met His Ser Phe Ala Phe Tyr Asn 
130 135 140 

Gly Tyr Leu Trp Gly He Lys Glu Glu Leu Gin Phe Ser Lys Ser Ala 
145 150 155 160 
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Leu Ala Lys Tyr 



Leu Glu Lys Ser 
180 

Leu Ser Asp Gin 
195 

Phe Phe Trp Phe 
210 

Pro Arg Thr Lys 
225 

Ala lie Phe Gly 



Met Tyr Pro Val 
260 

Thr Gly lie Leu 
275 

lie Val Pro Gly 
290 

Trp Asp Ala lie 
305 

Arg Tyr Phe Tyr 



Ser Arg He Trp 
340 

Tyr His Ser Ser 
355 

Leu Met Thr Phe 
370 

Val He Phe Lys 
385 

Gin Met Pro Leu 



Lys Asp Ser He 
165 

Cys Glu Phe Cys 



Thr Gin Lys Phe 
200 

Thr Met Phe Pro 
215 

Glu He Arg Trp 
230 

Thr He Phe Leu 
245 

Ala Met Arg Ala 



Asp Arg Leu Leu 
280 

Phe He Val Met 
295 

Leu Asn Cys Val 
310 

Gly Asp Trp Trp 
325 

Asn He Pro Val 



Met Ser Ser Phe 
360 

Phe Leu Ser Ser 
375 

Lys Leu Arg Phe 
390 

Val Ala Leu Thr 
405 



Asn Asp Pro Lys 
170 

Ser Phe Glu Leu 
185 

Pro Asn Asn He 



Thr Leu He Tyr 
220 

Ser Tyr Val Leu 
235 

Met Met He Asp 
250 

Leu Ala Val Arg 
265 

Lys Trp Val Gly 



Tyr He Leu Asp 
300 

Ala Glu Leu Thr 
315 

Asn Cys Val Ser 
330 

His Lys Phe Leu 
345 

Lys Leu Asn Lys 



Val Val His Glu 
380 

Tyr Leu Phe Phe 
395 

Asn Thr Lys Phe 
410 



Val He Gly Ala 
175 

Ser Ser Gin Ser 
190 

Ser Ala Lys Ser 
205 

Gin He Glu Tyr 



Glu Lys He Cys 
240 

Ala Gin He Leu 
255 

Asn Ser Glu Trp 
270 

Leu Leu Val Asp 
285 

Phe Tyr Leu He 



Arg Phe Gly Asp 
320 

Trp Ala Asp Phe 
335 

Leu Arg His Val 
350 

Ser Gin Ala Thr 
365 

Leu Ala Met ■ Tyr 



Phe Gin Met Leu 
400 

Met Arg Asn Arg 
415 
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Thr lie lie Gly Asn Val He Phe Trp Leu Gly He Cys Met Gly Pro 
420 425 430 



<210> 26 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arabidopsis 
ACAT-Like primer for 5 ' RACE 

<400> 26 

tgcaaattga cgagcacacc aaccccttc 29 

<210> 27 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arabidopsis 
ACAT-Like primer for 5' RACE 

<400> 27 

aaggatgctt tgagttcctg acaatagg 28 

<210> 28 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Rat 5' PCR 
primer 

<400> 28 

taggtgacag actcagcatt ccaccagtcc c 31 

<210> 29 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Rat ACAT-like 
nested PCR primer 

<400> 29 

cgccagcttt aagagacgct caatgattcg 30 

<210> 30 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Rat ACAT-like 
PCR primer 

<400> 30 

ggatccctgc aggtcaggcc cccaccgggg catcata 37 

<210> 31 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Rat ACAT-like 
PCR primer- 3 ' 

<400> 31 

ggatccgcgg ccgcacaatg ggcgaccgcg gaggcgcggg a 41 

<210> 32 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :C. elegans 
ACAT-like PCR primer-5' 

<400> 32 

ggatccgcgg ccgcacaatg cgtcaacaaa cgggacgacg g 41 

<210> 33 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: C. elegans 
ACAT-like PCR primer- 3 ' 

<400> 33 

ggatcccctg caggtcaaat accaacggtt tggttttg 

<210> 34 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arabidopsis 5' 
PCR primer 

<400> 34 

tcgacctgca ggaagcttag aaatggcgat tttggattc 

<210> 35 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Arabidopsis 
ACAT-like 3' PCR primer 

<400> 35 

ggatccgcgg ccgctcatga catcgatcct tttcgg 

<210> 36 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cloning 
Adapter 

<400> 36 

cgcgatttaa atggcgcgcc ctgcaggcgg ccgcctgcag ggcgcgccat ttaa 

<210> 37 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: pCGN8618 
Cloning Oligonucleotide 

<400> 37 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 

<210> 38 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pCGN8618 
Cloning Oligonucleotide 

<400> 38 

tcgacctgca ggaagcttgc ggccgcggat cc 32 

<210> 39 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pCGN8619 
Cloning Oligonucleotide 

<400> 39 

tcgacctgca ggaagcttgc ggccgcggat cc _ . 3 2 

<210> 40 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pCGN8619 
Cloning Oligonucleotide 

<400> 40 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 

<210> 41 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: pCGN8620 
Cloning Oligonucleotide 

<400> 41 

tcgaggatcc gcggccgcaa gcttcctgca ggagct 

<210> 42 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pCGN8620 
Cloning Oligonucleotide 

<400> 42 

cctgcaggaa gcttgcggcc gcggatcc 

<210> 43 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pCGN8621 
Cloning Oligonucleotide 

<400> 43 

tcgacctgca ggaagcttgc ggccgcggat ccagct 

<210> 44 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pCGN8621 
Cloning Oligonucleotide 

<400> 44 

ggatccgcgg ccgcaagctt cctgcagg 

<210> 45 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Cloning 
Adapter 

<400> 45 

caggaggcgg ccgcaggagg ctgcaggtac 3 0 

<210> 46 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cloning 
Adapter 

<400> 46 

cctgcagcct cctgcggccg cctcctgagc t 31 
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CTCTCGTOAATCCTTTTTCCT^^^ 
'rreC'l^ATAAGGAACCAGACACGAATCCCATT^ 

TCCGCTCTTTCCCTCTCCATTAGA'rrCTGTTTCCTCTTTCAATTTCr 

CGATTCTCTCTGAC^CCTCTTTTCTCCCGA^ 

CGATTTTG^TTCTGCTGGCGTTACTATO^ 

ATCTTGATAGGCTTCGTCGACGGAAATCGACATCGGATTCTTCTAACGGACTTCTTCTC^ 
CTGCTTCCGATAATAATTCTCCTTCGGATGATGTTGGAGCTCCCGCCGACGTTAGGGA'TC 
GGATTGATTCCGTTGTTAACGATGAeGCT^ 
ACGGTGGTCGttlATAATAACGGTGGTGGAAC^^ 

ATOCTACGTTTACGTATCGACCGTCGGT'TCCAGCTCATCGGAGGGCGAGAGAGAG'rCCAC 

TTAGCTCCGACGCAATCTTCAAACAGAGCCATGCCGGATTATTCAACCTCTGTG 

TTCTTATTGCTGTAAACAGTAGACTCATCAT^ 

TCAGAACCJC^TTTCTGGTTTAGTTCAAGAT^ 

GTATAl^CCCTTTCGAlH2TTTCCTTTGGCTGCCTTT^ 

AATACATATCAGAACCTGTTCTCATCTT^ 

TCTATCCAGTTTAC^TCACCCTAAGGTGTGAT^^ 

T<TCTCCUY^CT?GeATTGTGr^ 

taagatccctagccaatgcagctgataaggccaatcctgaagtntcctactacgttagct 
tgaagagcttg^caratv p l>catggtcgca , cccacattgtgttatcagccaagttatccac 
gttctccatgtatacggaag^gttck3gtggctc^ 

ccggattcatgggatttataatagaacaatatataaattctattgtcaggaagtcaaagc 

ATCCTTTGiAAAGGeGATL s T^ 
ATTTATATGTGTGGCTCTGCATGTTCaV^Ca'GC 

cagagcttctctgcttcggggatcgtgaattctagaaagattggtgga^ 
tgggagattactgcagaatgtgglaatatgc 

acttcccgtgcttgcgcagcaagatacclaaagacactcgccattatcat'l'gcl'ttcctag 
tctctgcagtctttc^tgagctatgcatcgcagttc 
cttttcttcggattatgtttcaggtgcctttc 
ggtttggctcaacggtcgggaacatgatcttctggtt^^ 

CGATGTGTGOXSCTTCTTTATTACCAnGArc 
ACAACTGTTCAAAAAATGACTT'ICT^ 
GTTGTGGTGGTTCTGATGCTAAAACGACAAAT^^ 
CAATTAGAGTTCTTGTATCGCA 

>prot-ein 

MAI LDSAGVTTVTEWGGGRFVDLDRLRkRXSR SOS SNGLLLSG5DKIN S PSDDVG APADVK 
DRI DOTVNDDAQGT ANJjAGUNWGGGDWNGGGRGGG EGRGNADATFT YRPS VFAHRRAKES 
PLpSGDA IFKQS3 3 AGLFNLCWVL I A VNSRLi 1 1 ENLMK YGWL IRTDFWF SERSLRDWPLFM 
CC I SLS IFPLAAPTVRKI . VLQK Y 1 SliFWXFLHI 1 1 TMTEVLYP WVTLRCDSAl^LSG VT 
LMLLTC I VWLKL Y AHTS YDI R £3 L ANAADKftNPEV S Y Y VSL^SLAYFMVAPTLCYQ PSY 
PRSAC1 RKG WVARQFAK1. VI FTGFMG Tl I EQY IN PI VKN S KK PLKGDI..LY AI ERVLKDSV 
PNLYVWIXMFYCFFi JLWLNI LAEiLLCFGDREF YKDWWNAKS VGD YWRKWMMPVKKWMVRH 
I YFPCLRSK I PKTLAI 1 1 AFLVS AVFHlii-C I AVFCRLFKLW AFLG INPOVPLVF I TJMYIjQ 
ERFG ST VGNMT FWFIFC I FCQFMCVLL Y YHDLMMRKG SMS 

Figure 1 
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GAGNl^NGN^AAnGTTTAGCCTNCCGTAGCCCCCAA^'lCCAaGGGWCNACCNACCCTNCG 
TTANACTNAATTNGAA^TWCN^NCC<^ 

AACGGftGCACTATATTTACCCCGTGGTMCyi ,r ICAACCCAGCCATCTCACCCTTGCGAGCAT 
TGGTGCTtiCTC r rrGATACCCTTCATGCTTAACTATCTCATGATCTTTTACATCATTTTCG 
AGTCC^TCTCCA^CGCCin'TttCGGAACTAAGTTGCTTTGCGGATCGCAAO'ta^TACGAGG 
ATTGGTGGAArTGCGTCAGCTTTOATGAGTGGCCACGeAAATGUAACAAGCCTGTGCAAC 
AC^^TTGC'l^CGCCACGTGTACGACTCGAGCATCCfiAfiTCOTTCCACTTGTCCGAAATC 
CAA'IV3CCGCNAATTCCAAACGTPCCTTCCCGG'iX^G , l , CAA , I'GCGl'TCAACGAACCTGGGTG 
AAGAA'TGGGTOGTGACAACGTTAAAGTGCGCCCGGTATC 

Tigure 4 



WO 99/83096 PC1TOS99/US41 

5/17 



TGGACGACAACGCGGGGTCTGATAC^ACTCACTATAGCGAATTTGGCCCTCGAGCAGTAG 
^TTCGGCACGATGGGCACGAGGACTCCATCATGTTCCTCAAGCTTTATTCCTACCGGGAT 
GTCAACCTGTGrfTGCCGCCAGCGAAGGGTCAAGGCCAAAGCTGTCTCTACAGCGAAGAAG 
GTCAGTGGGCSCTGCTCCGAGCAAGCTGTGAGCTATCCAGACAACC'TCiAeCTACCGAGATC 
■rCGATTACTTCATCTTTGCTCCTACTTTGTCTOAI'GAACTCAACTTTCCTCGGTCCCCCC 
GAATACCAGAGCGCTTTCTGCT/iC^CGAGTTCTTGAGA'^GCTCTTrTTTACCCAGCT'PC 
AAGTGGGGCTG ATCCAACAGTGG ATGC TCCCTACTATCCAGAACTC CATUG AAG CCCTTT 
CAAGAGC^'CTGCAGTTrTGGAGACCQCGAGTTCTACAGAGATTGGTGGAATGCTGAGTC 
TGTCACCGAL'TTTfGGCAGAACTGOAA'l'ATCCCCGTGG 

Figur* 5 
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CCATGATGGCTCAf^'TCCCACTGGCCTGGATTGTGGtjCCGATrCTTCCAAGGGAACT 
ATGGCAATGCAGCTGTGTGOG'lVjACftCTCATCATTGGGCAACCGGTGGCTGTCTCAT 
GTATOTCCAC<^rTACTACGTGCTCAACTACGAlGCCCCAGTGGGTCATGAGCTACT 
G CC AAAGOCAGCCCTCCCTAACCTOGGCCTGG AGTTC TGGAGGGGTTCCTGGCTGCC 
TGC ACACTKC TCC TAGTCTGGGAGGCCTCTCTGCCCCT ATGCGCTACTCCTGCTCTT 
GGGGATGGCATTTG 



Figure € 
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aTCTGGTGTCATGGGGACACGGAGGGACTTCCCCTTACCCAC3CACTGGT«TTGGCTR 
AGGTGGGTG CTGAGTCTCAGAGCTTCGCATCG AO ACCAG ACAGGGCTGGG 'f CTGC AA 
GCCTC AGGC'A'GCCG CCC'IG AGCTCGGGCTGGG ACG TGCC C AGAGGTGTTGGG AGG AT 
CTGGGGTGAGTACCCTGTGGCCAGGACTAAAGGGGCTNCACCCTCCTGTCC^TCCCT 
CCCAGAl'CTTGAGCAATGCCCGK3TTATTTCTGGAGAACCTCATCAAGTATGGCATCC 
TGGTGrxACCCCATCCAGGTGGTTTCTCTGTI'CCTGAAGGATCCCTATAGCTGGCCCG 
CCCCATGCCTGGTTATTGCGGOT 

AGCGCCTCGCGGTGGGtGCCCTGACGGAGCAGGGGGGACTGCTGCTGCArGTGGCCA 

ACCTGGC7CACCATTCTGTGTTTC^ 

CTCCAGTGGGCTCCCTGCTGGCGCT^ 

TCTCCTACCGroACCTCAACTL^ATGGT^ 

C AGGG AAGAAGGCCAGC AGTGCTG CTGCCCCGC AC ACCG'1>GAG GTACCCGG AGAATC 
TGACCl'ACCGCGATCTC!TACTACTTCCTCTTCflCCCCCACCTTGTC5CTAGGAfiCTCA 
ACTTTCCCCGC^TCCCCGCATCCGGAAGCGCT'I'TCTGCTGCGACGGA^CCTTGAGA 
'XGCTG TTCTTC7ACCC AGCTCCAGGTGGGGCTGATCCAG C AGTGG ATGG TCCCCACCA 
TCeAGAAeTC^'HBAAGCCC^^^ 

TCCTGAAGCTGGCGG TC CCCAATCAC CTCATCTGGCTC^ATC'TTCTrCTAGTGGCTCT 
T<^ACTCCTGCCTGAA1X3CCGTGGCTGAGCTCATGCAGTTTGGAGACCGGGAGTTCT 
ACCGGGACTGGTGGAACTCCGAGTCTGTCACCTACT'lX^TGGCAGAACTl'GGAACATCC 
CTGUXinCACAAGTGGTGCATCAGACACTTCTACAAGCCCATGCTTCGACGGGGCAGCA 
GCAAGTGGA1K$GCCAGGACAGGGGTGTTCCTGGC^ 

TGGTGAG CGTCC CTCTGCG AATGTTCCGCCTCTCGGCG'rl'C ACGGGCAl'G Al'GG^ TC 
AGA'l^CCCACTGGCCTGGTTCGTGGGCCGCTTTTTC: c agggc aactatggc aacg gag 
CTGTGTGGCTGTCGCTCATCATCGGACAGCCAA'J'AGCCGTGCTGATGrACGTCCACG 
ACTACTACGTGCTrAACTATGAGGCCCCAGCGGCAGAGGCCTGAGCTGCACCTGAGG 

Cf?TC{^GTGCTGGGGATGGGCCTCGGTGCACAGCAlX;C , I^^CTCTGGTCCCAGGGAGG 
CtlX^TCTGCCCCTATGGGGCTCTGTCCTGCACZCCCTCAGCGATCCCGACAOCAGGCC 
AGACAC AGTCTGATGCC AGCTGGGAG'l'CTTGC TG AOCCTGCCCCGGGTOOfiAGGGTCi 

tcaataaagtgctgtccagtcacctcttc^^^ 

ggggtatggccac:acccacaagggcgagtgrcagan^tgtgtggacagctgtcccag 
gaoctgccggggagcagcagctccactgcagcagggcgggca'lxlngccggtaggggga 
gtgcaaggcc!aggcagacgccc:ccattccccacactc:ccgtacc:tagaaaagctcag 

CTCAGGCGTCCTCT 

Figure 7 



8NSDOCID:<WO 9963096A2TI > 



WO 99*30!* FCTVUSW/11541 

8/17 

CACGACTGGGCCGCG ACGTGGTG CGGGCCGAAGCCA'PGGGCG ACCGCGG AGGCGCGGGAA 
GCTCTCGGCGTCGGAGGACCGGCTCGCGGGTTTCCATCCAGGGTGGTAGTGGGCCCATGG 

tagaccjaagaggaggtgcgagac^ccgctctgggccccgacttgggcgccgggggtc^cg 
ctc<^gcgx^cggctccggttccggctccagccc , acacccgggacaaagaccggcagacca 

GCGTGGGCGACGGCCACrGGGAGCl^AGGTGCCATCGTCTGCAAGACT'CTTTGrTCAGCT 

cagacagcggttt , cagcaai , i , accgtggtaox:cu^aattggtgcgtc 

TGAGTAATGCAAGGTTATTTTTAGAGAATCTTATC 

TCCAGGTGGTGTCTCTG TTTC'JXSAAGG ACCCCT AC AGCTGGCCTGCC CCATGC TTGATC A 
TTGCATCCAATArCTTTATTGTCKCTACATTTCAGATTGAGAAGC 

CCCTGACAGAGCAGAO^GGCTGCTGCTACATGTGGTTAACCTGGCCACAAl^ATCTGCT 

rCCCAGCAGCTGTGGCCTTACTGGTTGAGTCTAT 

TGGCATCATACTCCATCATCTTCCTC^^ 
GCCGCCAGCGAAGGGTCAAGGCCAAAGCTGTGTC 

CTGCCCAGAACACTGTAAGCTATCCGGACAACCTGACCTACCGAGATCTCTATTACTTCA 
TCTTTGCTCCTACTTTGTGTTATGAftCTCftACTTTCCTCGATCCCCCCGAATACGAAAGC 
GCTTTCTGCTACGGCGGGl'TCTTGACATGCTCTTTTTCAGGCAGCTTCAAGTGGGGCTGA 
TCCAGCAGTGGATCGTCCCTACTATCCAGAACJTCnATGAAG^CCTTCAAGGACATGGACT 
ATTCACX^AATCATTGAGCGTCTC'riT^GC^ 

TCTTCTTCTATTGGCl^TTCCACTCATGTCTCAATGCTGTGGCAGAGCTCCTGCAGTTTG 

g agaccgcgag ttct acaggg actggtgg a atg ctgftgtctgtcacctacttttggcaga 
actggaatatccccgtgcacaagtvsgtgcatcagacacttgtacaagcctatgctcagac 

TGGGCAGCAACAAATGGATGGCCAGGACTGGGGTeTTTTTGGGGTCAGCC'rrCTTCCATG 

agtacctagtgagcattcccctgaggatgttccgcctctgggcao'tcacagccatc^tgg 
ctcaggtcccactggcctggattgtgaaccgcttcttccaagggaactatggcaatgca^ 
ctgtgtgcctgacactcatc?attgggcaacf7ggtcmctgtgctcatgtal , grccacgact 
actacgtgctcaactatgatgccccagtgggggcctsagctactgccaaaggccagccct 
ccctaacctgggcctggag^rctggagggct'i^ 

TGGGAGGCC'I^TOTGCCCCTATGGGGCCTACTCCTGCTCTTGGGGATGGGACC'reAGTCC 
AGCTGGTATGAGCCAGTGCTGGGAGTCTGTGCTGACCAGGGGCTGAGGATA'FCAATAAAG 
AGCTATCTAAAAAAAAAAAAAAAAAA 
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PCT/US99/1254I 



CAC^CTGCX3CC£CGACGTGGTGCG<3GCCGAA^ 
GCTCTOGGCGTCGGAGGACCGGCTCGCGGGTTTCC^ 

TAGACGAAGAGGAGGTGCGAOACGCCGCrGTGGS3CCCCGACTT^GCGCCGCGGGTCiACG 

cttoggctc^ggctccggttccggctccagccc^ 

gcgtgggccacggccactgggagctgaggtgccatcgtc^ 

cagacagcggtttcagcaattaccgtggtatcct 

TGAGTAArGCAAGGTrATTTTTAGAGAATCTTATCAAGTATGGCATCCTGGTGGATCCCA 
TCCAGGTGGTGTCTCTGT'JTCTGAAG^^ 

TTGCATCCAAPATC'l'TTATTGTGGCTACATT'i'CAGATTGAGAAGCGCCTGTCAGTGGGTG 
CCCTCACAG^GCAGATGOGGCTGCTCCT^ 
TCCCAGCAGCTGTGGCCTrACTGGTPG^ 
TGGCATCAIACTCCATCATCTTCCTCAAGCTTT^ 

GCCGCCAGCG AAGGGTC AAGG CC AAAGCTGTGTCTG C AGGGAAG AAGGTC AGTGQGGCTG 

CTGCC^AGAACACTGTAAGCTATCCGGACAACCTGACCTACCGAGATCTCTAT i rACTTCA 

TCTTTGC'ItCTACTTTGTGTTATOA^ 

GCrTTTCTGCTACGGCGGGTTCTTGAGATGCTCTW 

TCCAGCAGTGGATGCTCGCTACTArCCA 

ATTCACGAATCATTGAGCGTCTCTTA^^ 

TCTTCTTCTATTGGCl^TTCCACTCATGTC 

GAGACC^CGAGTTCTACAGGGACTCGTGGAATGCTGAGTCTC 

ACTGGAATATCCCCGTC<^CAAGTGGTGCATCAGAC^ 

TGGGCAGCAACAAATGGATGGCCAGGACTC^GGTCTTTTTGGCGTCAGCCTTC5TCCATG 
AGTACCTAGTGAGCATTCCCCTGAGGATGTTCCG^^ 
CTCAGGTCCCACTGGCCTGGATTGTGAACC^CTT^ 
CTGTGTG«GTGACACTCATCATTG<^ 

ACTACGrLy3CTCAACTATGAa'GCCCCAGTGG<5GGCCTGAGCTACTGCCAAAGGGCAGCCCT 

CCCTAACCTGGGCCTCGAGTTCTGGAGGGCTTCC'IKS^ 

TGGGAGGCCTCTCTGCCCCTATGGGGCCTACTC 

AGCraGTATGAGCCAGTGCTCGGAGTCTGTCCTGACCAGGGC^ 

AGCTATCTAAAAAAAAAAAAAAAAAA 



Figure 10 
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PCT/U999/1JSM1 



MGDRGGAG SSRRRRTG BR VS 1QGG SGPMVDEEFjVRDAAVGPDLGAGGEiA J>AP APVPAPAH 
TRDKPRQT5VC5DGHWELRCHRLQDSLF SSDSGFSNYRG7.T,NVJC WMLI LSKARLFLBNL I 
KYG11»VDPIQ\A^SLFLKDPYSWPA.PCLIIASNIFIVATFOIEKRLSVGAIjTEI3MQLI^LiHV 

VNLATI TCFPAA VALLVE S I TPVG SLFAL ASY S3 1 PLRL-FB YRDVNLWCRQRR VKAKAVS 
AGKKVSGAAAQNTVSYPDNIiTYRDL^PIPAPl'LCtE^f'PRSPRlRKRFLLRRVLEMLF 

FTQLOVGI-I QOWMVPTI QNSMKFFKDMOT SRII ERLLKLAVPNHDIWL1FFYW1FK SCLN 

AVAEIjLQF^DREFYRDWWI^E!>VT^ 

FIASArrHEYLVSIPLRl^RDWAtfTAMh^^ 

AVLMYVHDYYVLNYDAPVGA 

Flgur* 11 
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PCI7US!WI1541 



MRQQTGRRRHQPSETSNGSLASSRRSSFAQNGNSSRKS5EMRGPCEKVVHT 
AOD SLFSTSSG WTN F RGFFNLS1 L LL V LS NG R V A L E NVtKYG I LITP LQWI STFVE 
HH YS IWSWP NLALELC SN IQILSVFGM EKI L E RG WLQNG FAAVFYTS L V I AHLTI P 
WVTT-THKWKNPLWSVVMMGVYVIEALKF1SYGHVNYWARDARRKITELKTQV 
TD LAKKTC DPKQFWD LKD ELS M HQM AAQYPA NLTLSN I Y YF MAAPTLC Y E FK F 
P RLLRr RKHF L I KRTV E L I FLSFLI AALVQQW VVPTVR NSMKPLSE ME YS RCLE RL 
LKLAJPNHLIWLLFFYTFFHSFLNLIAELLHFADREFYRDFWNAETIGYFWKSWNI 
PV H RFAVRH IYSPMMRNN FSK MS AF FVV FFVSA FF H EYLVSV P LKI FRLWSYYG 
MMGGI PLS I ITD KWR GG RTG N 1 1 V WLSLtV GQPLA I L MYG H DWY I LN FGVSAVQ 
NQTVGI 



Figure 12 



PCTAJS99/I25*t 

WO 99/63096 

1.4 / M 

-nTGATATGTACGGTAAATGGAAAAAAGGTATTCATGTATGGCAAQGTGGTA. 

GAGAAAAATAAGAAAACGACCTGTAATACATGAAAAAWCAATAGGAATTT 
TGAGATAATTTGGCAACATGCAATATAATGATTATAATAAAAAACTTGTCTTA 
AGACTAGAGAACTGCTAATTGAAAAAAAACAAATTGAGATAAATCAAATACC 
AACGGTTTGGTTTTGAACTGCTGAAACACCAAAGTTGAA 

Figure 13 
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TGCAAATG C G TCAACAAACG GG ACG ACG GCG GC GTCAG C CTTC G G NAAAC 
ATCTA ATG GTTCTTTGGCTTCC AGTAGACGCTCCTC ATTTGC ACAAAATG GT 
AATTCGTCAAGGGAAAAGTTCAGAAATGAGAGGACCTTGCGAGAAAGTGGT 
ACATACTGCTCAAGATTCATT GTTTT C GACGA GTTCTGGATGGAOAAATITC 
CGTGGATTCTTCAATTTGTCTATTTTACTTTTGGTACTTTCAAATGGACGCGT 
GGCACTTGAAAATGTGATCAAATATGGTATTTTGATAACACCCCTTCAGTGG 
ATCTCAACGTTTGTTGAGCATCA CTACTCAATTTGG AGC-TGGC GAAATCTTG 
CTCTC ATCCTATG CTC A A A 



Figure 14 
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<12 0> L&xSft&X, Mike 

Rut*iiirtsky , Diane 

<12 0> A<:yl-CoA: Cholesterol Acyl transf «r?so ttt=l*L*<i Mucieic 
A<:i<3 sequences 



<1J 1> 



<153.> 199S-Dfc;-Ob 



<!?□> Pa Lent in ver, 2,0 



<2lG> X 
<211> 

<212> DNA 

<213> Arabidopeis t ha liana 



<JOO> 1 

r:tctcciLgaa Lcctttttcc tttcttcttc 
LLtctataas {jaaccacaca cgnatcc cat 
LccgcLcttt crctctccat tagat t<?tgt- 
ccjstLctDtc tgncgcctci. tt.tctcccgp 
cciattttgGa ttctactggc st tact-ac-sc 
stcttfjataa gcttcgtcirn cggaaatcgo 
c tggt t ccaa t aataa t c ct cct togga tg 
ggaLtgatLc cgttattaac gatuacBCtc 
acrggtggtgg egataataac aatgatsgaa 
atgctacgtt tacgtatcfsa ccsrtcfiattc 
ttagctcega cgcaatcttc aaacagagrc 
+J:rttatLgcr Lgtaaacagt agactcatca 
fccragaaegga LLLctggttt agttcaagat 
gLataLcccrt L Legate ttt cctttgcjctg 
^t&^Akat.^ agaarrtgtL gtcatctttc 
tgtstccagt ttacgttMci: r:t.H»gg^-gl-i5 
tgctcctcac ttgccittgtg taoccaangt 
taagatccct agecaatgea sctgateiftgg 
tgaagagstt ggcat_at.tkr- ntggfc^g^tc 



ttcttc^^r.l. i:agagaaaac ttLacitctc GO 
t.c!<7.c*s«t?gaL ttcttaciett cttccttcaa 120 
tt.w:ti:i-l-tc aatttcttct gcacgcttct 180 
cgc.r.gr.ttcg teaascfjett ttcgnaatgg 240 
tgacggagaa eggtgyegora gagttcatcg 300 
gpcnggatLc LtcLaacgga cttcttctct 3G0 
atgtt.ggagL; Lcccgccgac gttagcaatc 420 
aggga^cflgrj: <:aaLttggcc tigagatsaLa 4&0 
gaggi?gg<?gg agaaggaaga ggaaaccaccg £40 
cagctccitr-g gagggegaga gagaatccac 600 
atEceggpt-*- aLtcaacctc rgtgtacrtag 660 
tcsaaaatct. Krt^gaagtaL ggttggttga 720 
CC5Ctgcg?lBf> fctggc:c:gc:LL tLeatgtgtt 7BD 
ectttaeggt t.gag»A.sLL:g gLacLLcaga B4 0 
ttcatattat t.nt.c#«t:«Ly Htagaggttt !>0D 
ALtctgctLt tttotcoggt Dtc^t+tty* l JbU 
hggtLLtiLta tgctcatact agctat-gsc* in 20 
c^flnr-^^Lga acytctcctac tacgt.tagct ^HRfl 
cc^cattfltg ttat<?3g^r;a flgt.tBtccac 11 



WO 
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Dttctgcfttg tatar.ggwag gy ^Lgggt.gg r: i:<:y Lcaa 1 1 tgcaaaactq Htcatattca 1200 

ccDQCittcat gg*j3tt--t-.rtKrt a^i^^czant, Mt.aLafidtcc La ttcj teaag aacccaaaac 1260 

atcctttgac* ag«jcffat.<:M. i:t.w l.*l.g«:v.M i-.i.gAaagagjt ijttgaagctt tca^Ttccaa 1520 

atttcttatgt gtggotct.g<: «r.tjt.Kc:l-.Mr:i: c«:Kt.cLtcc£ ^cttt<|fft ta aacatattgq 1380 

ca^affcttct ctsrcttcggg g^togtg^at. t-c to* LigciL-ggaat gcaaaaagLtj 146U 

tC3C*f5stt?atta ctijgago^i-.g tLjgwt.^tgr^ r.hghtr:al_A<s, ateiciarcsaLt cgacatatat ihiuo 

acttcccgtg cttgcgc«gc ^agwt«ccei?i «gf*ctf r;tng<r: <raLLatcaLL gctttectag lSGG 

tctctgcacjt ctttc^tg^g ^tcitgcat-CD cagfctnr.ttg tc^LctctLc aaactatgejg l62t> 

cttttcttog ^rcittatE3ttt swsgtgcctt tggtc^t.r:«i. ^acaaactat c La cragg&aa 1630 

GGtttggctc aacggtgggg anwtg^tnt t<;r-,gg*Lt-E:«L crLLcLgcaLL ttcg^acaac 174 0 

cgatgtgtgt gsttcttt-nt. t a oc;*ccj?inc tg.^gruic:<:g aaesaggatca atqtcatciaa Id 00 

acaact q ttc aananatg^r. t-t-tctt-oa^rt cei^ctJihggc; i_- Leg l L gya c ctecgttciat 1360 

gttgtggtasr ttctgatgct suncscg^ca^ atrtgt.ryt.t.flf. haccb llci** ciaayaa<s*g« 1920 

caatLagagt ttft tgtc* teg co. 1902 

<21i)> 2 
<211> 520 
<212-^ L*KT 

<4 00> 2 

Met Ala lie Leu Asp Smr ^lii Oly V*il Thr Thx- V«l TIij- Glu A&n Gly 
15 10 " 

Gly Oly Glu I'ho V^l Asp T.rn A*p krg T.mi Arg Aiy Ai<| Lys £5er Arq 
20 25 JO 

Spy Aj;p Snr Sor A:;ji Gly f..c>^ I^mj :.e?i> Scm Gly Gfer A Bp Asn Asn Serr 
15 ^0 *b 

JTO Sr?r Ar;p VmI Kly A"l PrrvAlfl A:^> Val Aret Asp Ar<| lit Asp 

^0 " «>U 

£er V 3 1 A:;ti Aiigj Asp Al* CI n f:ly T*n- Ales A£Jn Leu Ala Gly ASP 

£5 70 7S 60 

Asn Asn UlY (flv Olv ABP ASH Asn Gly Oly Gly Arg fi.ly Gly Gly Glu 
85 90 9* 

Gly Arg Gly A Em Ala Abp Ala Thr Phe Thr *JVr £Tg Pr<i Si>x- V«l Pjt« 
100 105 110 

Ala jJie Ar<x Ar<? Ala Arg Glu Sex Pro Leu tier t;er Aep Al« Hp I J be 
115 12U 125 

Lyi; GL/i Stfi- Uis A Lei Gly Lisu PJm A; in l^u Cyl; Val Val Val Leu Ilfi 
l.SQ 13F> 140 



'2 
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Ala Val Aen iJer Ars He ile <ilu Asn L.eu "i.ys Tyr Cly Trp 

150 15S 160 

Leu Ile Ara Thx Asp l>he Trp l*e ser i*r Arc $*r l-ou Arg ** P Trp 
165 115 

Pro Leu Phe »et Cys Cys He Der Leu Sex He l>he I'ro Lev Al.fl Ala 
180 1*5 ISO 

Phe Thr Val Glu Lys Leu val Leu Gin lvb Tyr He Ssr Clxi Pro Val 
1$5 200 205 

val lie Phe Leu Bis lie He Ile Thr Met Thr Gl u Vol Lexi Tyr pro 
210 3 15 220 

Val Tyr Val rhr Leu Ar?j Cys Asp Ser Ala Phe Leu S*r <Jly Vol Thr 
22S 230 235 240 

Leu Met Leu Leu Thr Cys ile- Val Trio Leu Lys Lew val Ser Tyr Ala 
245. 250 255 

His Thr ser -JVr Asp .lie Arg- Ser Le-u Ala Abh Ala Aln Abp Lvs Ala 
260 i6B 270 

ABn Pro Glu Val Ser Tyr 'iyr val S>er Leu Lys- Ser Leu A1p Tyr I'he 
27& 260 2&-> 

Met Val Ala Pro Thr Leu CVs Tyr <Sl« Ser lyr l'ro Axs S*r Ala 

5*3S i0 ° 

ry I l e Arg Lys Gly Trp val Ala Arg Gin Phe Ala Lye Leu Val Lie 
3DS 31° i2 ° 

Phe TJir Gly Piie »« Gly Phe He He Glu Gin Tyr Ile nsa Pro rl* 

525 33° 3J»S 

Val Arg Ann K«.zr Lyb Hi is Prci Ltsu Ly:: Gly A£ip Leu Leo Tyr Ala lie 
14D 34S 3.5 U 

Gly Arg Val Leu T y! -. Leu Scr VdJ P™ ixw Tyr V«l Trp Leu fTyw 
■355 3$0 56S 

Met Phe Tyr Cys Phi? Ph© His Leu Trp Lew Asn Il.o r..;u Al. fl <Mv r.cv 
370 3?5 ™° 

Leu Cys Phe Gly Asp Ar?r Glu Phe Tyr Lys Asp Ttp Trp Asn Ala LyB 

390 33!» 
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Sc-r Vfll C3y A:-.p Tyi- Ti p Arg Hr^r. O'rp A^n MesI: Pru Vfil lti3 Ly£ Pxp 

4LD lib 

Hcit Ajxj W.-iri lit* Tyr P]j^ Pn> Cy=i L«u Ary LyEJ lie Pro Lyii 

430 425 430 

Ttir Leu Al^ T] & n« lie: Al* Ph* Vffl Bisr Alu Val PH* Gl.u 

43? 440 44h 

C^u Cy& II*? Aid VrtD Pro Cy* Am Leu Ph«J F.y* Lrsu Trp Alii Pint L<™ 
450 

Cly II b Met Fh* OJti V*n Pro L,eu Votl Phs He Hit A*ti Tyr r..i»j Gin 

46? 470 -H5 JRO 

01" Arg Pho OTy fl«r Thr: U«] O^y A^n 'H^t T l.c: Fbc: Tvrp Ph« Ilci Ph«? 

4Rfi 490 

Cyp Jlisk Ph<? Oly P. In T'tq Mot Cy& Val f^M l^V Tyr Tyr tti.-s Asp 1*<3V 

?HOO 50f. Fvl<J 

n*t A&o Arg Ly* Uly $*t; Met ser 
^15 520 



<2*0> 3 

<211> 214 

<212> DMA 

<213> <3J.y<:ijLPt jnnx 

<4D0> 1 

gCda-gfrt LCa tigai^cLL^gc: t:*-.r;.ct,y gfctgr:t:[:<:l:« n<:/s LLal.L^I.I. ^ C:L^m^^c:aMri GO 

CtatCCtCCic: aeaccLL<Jl.a LL<:g«G3ggg t:t:g[jc:f.ghH:_ L:y<:Ca*jCl_Lo Le&tiC-Lgrtt.H 1.30 

•atatt £aC£Q QbgLL&LQyy Mhl'.hat>9t»tP g?9r:9nK/9i:d I.L«al:i:L: L:a,L Lt^l-^^/aart m 1: l.JJO 

Iracragc&Lc LrteLC&a^yy naia^rr.t.f.t.f. t.nngnLZtt teg ag^gagiiKLil. y<a*±y 2.3 J 

<2ia> 4 

*212> hna 

ctuct.ttt.gt ntctggtgtc acqctqatqc tattadcf tr? catt-atDtgg ttnnaaf.C gir 60 

tgtc-Ptntgc ncatac^aac tataatatga jTaqcactcnc tgttrcsaot gcuinaagfTnEr 12i) 

aaacattacc caatactttq atatgggagta tcccitacact gtgaccttca sroragtLt^gc lBO 

a t act t cats Bttcjctccta catcalled. a l.craeiacaagc' tatcctccica caccttcacrt 24 0 

tcgaaagggt tgggtgtttc gLcaacrl. 2t>7 
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*211> 2^ 
<:212^ r«A 
<213=> Glycino max 



gtggaatcjcc nzLZiactgl'.l.g wgnttftttg 

gntgntccgc caci?t.*t-.fthh r.c.r.^t.gt.tt 

tttaattgcc rvC^tggH.I:<: l.g n ttt* etc 

catat-t-CRflg tTtgt.gygliLL cnijng^^U 

nttmt-nggc np9i*n»»l-l"-*- <:Tmg**C<?cr: 



gagy^tytgy **i:*i-y^Lt| m.^rt<;tiyHt.y fiD 
3aggr.jic:ggt: ftl-.rtc:t:»iaagy c:<:y r.tgr^1\r:t. 120 
i^Vt-g^gczhgt. g<:«hiry<:L£L L<;<:ttgr.c:^rt 3RD 
nag h r*. r.r:«gg h.u<:<: 1- l-g££ 1 1. L<:na r?Eumfl 24 n 

275 

ggggg 



<210* 6 

^211> 2*7 

^2JL2N DMA 

^ 5 3 3 > L': 1 yc: jnax 



»Hr; S g»*tl-.g H gi«:l.<:c:«g« gM«h«hgcw t: g t-« tt * « t. » * t: l:g t.. ma^A 60 

ggr:i:tf.Kgg« H«i««:l-gg«« tgct.t.cr.tt.r. MCJiaghggc hfcg1:g*ggfc« h« h* t a M tt- 1^0 

cctot-t.gggg gat<:t.*«g«fl muigctacw wCgtgtggg ttgtt.ntr-wt: n r, K c gt-tgcn 1.P0 

p^ggcAhg attl:«g«gl-.g gwitgottM-t fccvifcgggnkfc gg1-.t.gac:g1:g LI-.h*t.tc:t.l-.<: 240 

ntrx^tgftgK tygl.r.l.l. 



<210> 7 
<212> 253 
<212> DNA 
<213> mays 

<dOO> 7 



agaaaaLgg* ana tgc<: L*, t gc£Lt.«ml.gg a vt 9 l L * & Le atiLdiattU LcciL^i.g faO 

C :ga*fiLggL* uL^ggn »yU:gr.fcyt.h UitA Lfl i-tg t tcLLgctlLcL 9 u<.y L*<:l.r.<: 220 

aLgaejLLaLg 1. $ L Lc,C Ug U. to:t.gr:i:M<:« U; A a* L L CLCiggtLLLL i.-^ngya*!.*: 1H0 

atfftttflfiW tLGMfitCdl. L *L**.1:tf*<:« LMLfiL-tLca MMtaMLt. Cdyl.^ci»» 24D 

* 253 

atgggttflgca ata 



«:21U> H 
<211> 2 50 

<2l2> Z<ea mays 



<400> U 

L^aagta LijCj f:LLesH.fltil.i* »g« t^:r-gg^:*- 
gc3f^£3£:L:yL , :1: ««hgt:y1: t.yc: cttagt-Ct^n 
aaaegttggc ftttcaa^acit ct^attagtsi 
ttacaaratt tgaaat-t rjto. totccaatsrc 
tat eaglet t tgtg 



I: I: Kg43-hl-L.au Lg^LnLlAL^ii 1.1.gc:g«^a<:h fiO 
nc.nC5t.i-.hr-r: c:c:t.t.gy Lgcu t: ttgcrtgh^g l?.fl 
atcctcctnc tac.c-tgt.ttt cacatccttt 180 
tcacsjnttct tansrtstgat tctgcagttt 240 
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<'2l\* 262 
<2V2> DMA 
<2l'3> z&a trays 



<4DD> a 

^i^tatgqrc tta^taataa gatctgtjctt ztcgtt ta^t- grtur:* i-.<:«i. i.ynyagcictg CO 

erccacrgcta atciifittgcc C t □ t?t c t-ac c catatttccc cf-.f.ggt-.g<:«i. iM.ynagtcgft 120 

aaagLtggca ttcaacaatc tc-Dttaqtsra tcctgctaot o»crAr.rjr.r.r;t:.r; rtr.wtDctttt 180 

Lacaacattt iiaaattQtat atccacitaci est cat test. t. o*?4jt:gi-.gtf tr. <:':.y^agttt t 240 

±cag<fctttq tcrttnatgtt to 262 

<211> 325 
<.2"L?> DMA 



<4H0> 10 

L««LL:rtfc±ii£t: tcgi^rncncKi ^tcaeiccgta tnccatgaqa catqtaotrrc smrGCCfTt^c ^0 

i.-acacarktca Jiacctnacjca ^nncneiggat catnsi^tcao ciraccgntnrs nnttcttgac 120 

rtagathiL^Lcj tjatgctacci t rcaagtci l^l aatg^tggqc aac^Lgaran tg^ntcin ISO 

<:«gi:tiliiiy Lc <r<facac3cc?i<t t cftniuru^nrja tctacactac cacgacgica tffiiiicasrcrca 240 

yyi^iragge* agtagataqi ncqcicagaga ca tat acttc nacatcgnnc ntcagncmca 300 

TLuejLgagcgt* -gcpfffCeirKjcio rccacic 325 



<2ia> 11 

<211> SIS 

<21'2> DMA 

<2113> Morti&reilla alpine* 



<&QQ> 11 

gagnnnngna acgttta-CTCC t n-c cqi scjc c qccaaaatcc aaflflijncncic cn^ctnog 00 

Ltanaccnaa ttiwaaatn cnniK'ccaac ttnaqanact trnionncccc ocnacttrjcic i20 

aa^ggaojcac tatatttacc ccgtcg^Lngt tcaacccagc cacctcaccc ttgcg agent 160 

-LggtgcLcfcc crtctatacec ttcaTicsctLa actatctcat ciatcttttac- atcattttcg 240 

uyttjcaLcLg n-aacgcrcttt qc</gaacLaa gttqctttgc cmatCBcaa-c ttttacsaeo 300 

ntt-gytygisfl trLcjcs t cage LLLgal.yay 1. yeigcitgcaa a c tfcraacaa-q- cc t<jcgcaac 360 

ncukcliLgcL ccctccacgtci Laegft^Lcga gcatccgagt ccttccactt gtcc<iaaacc 420 

O^nLtgc^gt- - -" tt^LL^eaarit: y L?.c:<:l:t.c:riE: gy h<;y tiCuaL gc-gt LC£££<r aac-ccg^gtct 4 SO 

O&gBiial-.ggg:- yyLgacaaCy L.'..rtfi/fly L:yr:y cX;<;gy C.rt \.d SlU 



^211> 51R 



^400> 12 

tggaggaccu^ *;gn;g-gggtct gntacijactc ti ctcitngggsi .ihn.ggr^r:t. i:y ny^Agl.ay 60 
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attcaacacg ^.ggcjcacga gg^jtueatc atgr.tutftca agc-ti-.Lattc ctaccffggat 12G 

eitcaacctgt ggl-gccciccLi gcg«Hgc*gtc aaggi:i:aaaq ctgi-.<:LCtac agg?angaag ISO 

^tcagtggg* ^ctgcgan cMgcrlrgteia gctai-ceaeia cw^^acc tatfccagotr. 240 

ccgatta&tt cAtetttQTGt. <=<:fcnt:tttqt gt-t.»Lgaact Oi»«cCttCCt cgatccwr.r: 301) 

gaatacsaga g<:gctttctg cte^g^gag ttct.r.g«(at gctrl-Ltttt acccagcttc 360 

aagtsscGct g«i.L'caacag tggsl-.gg1.ee etact-icea gan*i:<r:<=atq gaagcccttt 420 

eaagagcttc W :ngtttta D W^ffi=ya gL Cepacia fintt.ggl.£ECfa atqct irate 
tqtcaccgac tt1:1:ggcaga nctgga*t-fi£ ccc-cgt-gg ^ lf * 



<2lU> 13 

<22l> 299 

<2l'A-- DttA 

•c2l3> murine 



<4DD> 13. 

ccataacOTi? r.tiaggtccca ctgg^yga ttgtgggecg attest.™ aa gggaactatn GO 

geaatneagr. t.yLgtggBtg acncw:iai:ca ttgggmftcc ti*Ttgg<r.hgi-e iccatqtarfft L20 

ccacgact-ac hat^rssctca ^^til:^' ccMgt-gycit catg*»grrl:*c: tgccaaasgc 1B0 

aoccct.ccct. M«tctqggcc t.gg«gi-Letei gangggl-Ucc tagci-.gc:c;L«, cacactcctc 340 

ctac?tctggg »gi>cctctPt 5 ww:tAt<fc CT ct!*<?tc:crg ctctt-gggga Lqgeatttg 299 



*211> 1895 
^212-.- UNA 
*"213> Kviitoti 

*4DD> 14 

gtctOTt-gt.g ftuggggacag gg^gggnc-Lt 

tggsTtgctgrc gh<: Leakage ttgg&sugg* 

ggct^ccGcc ctgfeic»ctccrcr Bct-ggg*<:gU 

gtaccctg tg gr:i:ayg£scca aa^TDgg*" t " riLr 

gcaataccca E3f.i--«l .LtctQ cjaaaacci-L-a 

aggLgyttic tctgi-v.crcrt.ct aaaaatrcne:!. 

c: gg<:raatCft cttfcgcH-c»l:g gctflcnt-t.™- 

tgacrggarsca ggeggga^-g etyctgeavg 

czagcggctcjt ggtcttrtt:Lg gttBagtCT;* 

t.ggc:£cscsc catcctctt.c: cucaaactot- 

gc*gggc<;ag ggecaagget*- gcctccRcag 

acacegtgag ctacQnggMc aatctfjacct: 

ql: & ret tgtg r ta cg»g<; I- ^ aac t tt cc c r; 

fcgegaeggat ccttg*g&*-g ctgttct t.^w 
ggatctgLcc-c cacwUr:My ^actccntg?* 

tCAl-Cg^gcg CCtCCtAJ^wg <:*.CjgC<rC3tCC 
f.gg ctCtL CCac tCOt-gc: r:t_y& at CJCCg 

aaetctnccg ggactcjgtgg aact-ccg^gt 
tccctgtcjca caagt.ggt.gc atcagacnul- 
gcaagtgcaat ggc^pygata ggggtgtt.r:t: 
cgagcotccc totg^gaatci ttcc{rcct.c:i- 



cccctticcc agcncf.gyug rtqactgngg fifl 
aacc^g^ccsg fffjctgg , gt-.<:L ^caa-cjcctga J 2n 
<rccc!»gag^r QLtggg^ggw L^tgggcstcjo IRQ 
accct-cui-tn. ccatccct.r:u eagatcttga 340 
Lcaatji;* 1 carcctg{jtg y acrccca t c c 300 
ctca-?TCt.gg<:t: rgccccatg^ nfcycjLLatrff 3G3 
&g<TttG^g«« c^cgcccngcg gt.gggcgc-cc 420 
v^QCcaa^rit- c^ckccaccat^. ct-gtr-y It tec 460 
Lc-actccpgt. gggrtccctg ctggi^ctqa D40 
hcL-ccraccg i^acrgtcaac tc?at.ggLgcc 600 
ggaagaaege r^icjeagtget {jotg^i^gc 660 
ticcgcQctK^t. trLacLacttc ot-nKt.c^cac 720 
gctctcccng c:atccgffaag cgvl-t-lictgc 7G0 
ivccaqc-rcr.-.a ggtgg^gctg otr-r-uycacit 840 
agcccttcft^ ggacratcjaac tact^Acgca 900 
ci:datcacct r:hiLc:Lggctc atctl*.<:l:Li^: 9 CO 
tggctgaqcc r.»1:g*:agtt:t PTffag^cnggLt 1020 
ctgt^rttfr^ta cttctgg^wg a^ctcigaaca 1D»D 
tctac^pgc:c: catqettega ciggy gcacica 1140 
Lcicicctsgg<: fJttcttccac g«g^acctga 1200 
cisifjcgttcrtt: gggcatgritg gnr.tagatic c 13C0 
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-cactggcctg gt.*.L:ytgggc CSICtttttCC affJISCaQCtv Cggr-flargtra tjc-LyLgLggc 1.S2U 

tctCQCtcat ^nk-.c:g^acaci ccoatagccCT tcctcotgt^ cgtc<:/ii:y^: ractacgt-gt: L2G0 

tcaactatga {jgoc^agctl cicaaacj^c-cc ga^ct gcncc t.g^gggr:c:i.g ycttctcafcL 1440 

qccacctcac acirv-a^tgcc aaasjcccacc tctcct octet {jg^r.ngrayL gctggggaLe> lb 00 

gcicctcigctg csnMg^Lcc tcctctggtc eeaciggacsac ctctcr.gccc E:l:AtggLjgt:i'. 

ototcctaca ccoct<:»yg<f aiffgcaacag cacjfTccaBac acag t ctg* y<:£agrLgyg lb 2d 

aCTtcttgct^g sr^ctgcccc gggtccciagg gtatcaataa og tgMghcc: AyLgacrcLoL I6iiu 

tcagccfcgsc aggggcjcligg fmcctaatgg gaaatatggc om^/icm Agggcrgs^tg IV du 

ccagagctgt gt.ge^agct atcccaggac ctcrccsrgaga gcagr^jy^u- eaccgr:ag<:a lUOO 

flsracaOT^at gg<:<:ggtagg s8aaQt<rcaa gci ccaasca<sr c?cgi;i?ccr.jjt. Locccrac&cjL latio 

cccpfcacc-ta gjiiftisgecca cctcogacgt. cctct ia9i> 

*210> 15 
<211> 17G6 
«212> UNA 
*213> wattus 

*400> 15 

cacBactggg wgriy MLVC^tg cjcgcggciccg aaciccatffaa ccaaccgogg^ ggc:g*:ggtj»«. bD 

BCtctcggcg -rgyu^yacc ^flctcqc<igg tttccatcca Qggtg!it*>gt. gyyc:<:r:n1:gy 12D 

tacjacgaaga ggngyiigcga <jacqccgctg tgg^ccccfia ctt^ETDcgct? gggygi:g*t:g 1RH 

ctccggctc-c ggemeegg 1 1 ccgqctccag cccacacccir ag^cwv>g-9C t:gg<:Bg»i:<:« 2 4D 

gcgtgggcg/a c:ygceacLag Qagctaaggt ciccatcatct gcnzirj^ctct- hKgLl^»g<jL 3 00 

cagacagegy Lttcagcaa t taccatagta tcctgaatto g trjcgtgg » y AL.gctgatcc J 60 

tgagtflfltgi: aaggttattt ttagagaatc tratcaagtP tggt.:*i-.r r:r.$ gcg-gatccca 420 

tccaggtggi: ejLctctcrttt ctqoasjBacc cctacag-otg gcr.-.tgcciric:/* Lgcrtgatca 4B0 

ttgcflt^i>un Lstctttatt fltaactacat ttcagactges gn»g<:gr:c:i:g Lcagtggcjcg 540 

r-r?nfcgt±Laya gcacia tsmoa ctgctsct^c argtggt-t^ t:c:r-.ggi:i.-acci attatccqct 600 

tccoagRagt; LgLtjgcct ta ctsrnttaagt ctatcactcc 3gt ggut.i-.<:<: eLgtLLgcrrc b60 

tagcat^ahA <:Lccatcatc ttcctcaaac ttttrtccta ocgggt«i-.yi.<- aateLgLggt V20 

gocgs;i7»g<:y a^gggtcaag accasacictg tfltctsreagg gnjigawgaM: ayLgg^gc-LQ VBD 

ctgoccrag/ifi u^crgtaacic tatcccjtiaca ac<rt gacc t n ccgngntcL.t: La L tact tea 8^ 0 

tct.tt.g<:r--c^ tactttatg-t tatsaactca actttcctcg flC,r;r:rn:<;<:gn aLa^gaaagc !>D0 

GCtttc=t:.g<-l- arggcggcitc cttgaeiarcfc tctttttcac ccagr.t.i-.r«n yLgg^g^Lga 060 

tccrtgc^gKg ynLygLccct actaLccraoia actccatsran. gecnttcflog gaeatgj^tt^L IDiiO 

attccacgn^t truliLgsgcgt ctcttaaagc tggcgatccc canecatctig * l:* Lta 1UB0 

tcttcttctn ttg5<=tLL\.c LracLtaLgtc: tc-aatgetqt fTffcagagoto cri-.grng+'.t.hg li^a 

gagaccgc^jn gttcit'.rt^wyy ya<;L^ytigg« tttgctgagtc tstcocctac tt.ttsgc^g« 3 

actaa^^tflt ccccgtgr^nc: H.^gt.ijgt-g^(? t-t:«gacactt ctacaafTcct Qr^jotc^gflc- 

tgcifl ca-gcaa CCSC^a tggaf.g gcic:^ ggttcr^g gggtu^-LLLL gqccitcagcc ttcttcotg 1330 

agcacctnfft gag^fltttitrt: crhgngg^hgh t.<:<:gt;crtctg ggcaCtcisca {jccfttgatgg )?S0 

titcaggtccc actg*jr?rit:gg flhtgtgwoc g<7 1: tc: t-.tctra aci^aaccat gecotatqeog 14^0 

crtgtgt^RRt gacQCtr;^tc fl^t-gggcaTic eggtg gf: l:g t gcccatgLat atccac?TLict 1500 

actaegtget -raact?it.grtr. g^occcgtgg gggr^Rr.gagt: L^Lrlgccaaa ^ccaqccct 

c:c^taaccLa a^cctggcigt tctsa^^rDcac ttcctggnr.g c:i:t.ijc:acact cctcctaqrtc 1G20 

tg^gagQCct ctct^cccct ntfscaaaccta ctcctgct-ct. t-.gggg«i*.ggL , r acct^agtcc lb60 
^geLggtatg agecagtget nROaatctcit <rctcaaccngg vs^^g^u^ 11 aLcaataaag 17^0 

Agcb^Lctaa aaaas^iiQ aactaaa 1766 
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^2tU> 16 
<2ll> lV6b 
<J12> DMA 



<dUO> 16 

tracrgactggg ccac?rac?gtg gr.gcgggc^g flsyccatsao cgaccgr-gg* gyirgcggg** bD 

gctctcggcfj tcgaajrerncc gg^ungciggg KLLccatcca srsatgg C» e f- gggcceaCgg L20 

tagacgaaga aaagsrtgcgtf gai^ngishg ugggc cccqa cttggg*s'=<- gggggtgairg LfiO 

ctccggctcc ggctccDBtt. r.r.vgrztvczny <:c:<eraeacccfi UGacattagpx: i:g£Cdgac£a 240 

gr;gtgggcga cyaccactgg gauct-g^ggt g«:iratcgtct scaagact-cb i.i.^l.l.i?flyi:l. :*UQ 

cagaeagcgg ttccaccaat. wwgtgg^ tr.nttgaatttf Btsjcgtggir.g nfcy<:by*t-.c:<: 360 

tgagtaatgc aaggttattt. t-tag^gfltftr; Mratcaagta tagcatcctg yhggdl.t^A &20 

Lecaggtggt eitctctgttt r^tg^gg^^ n: La eager t(? acct gcccc* i-.g<:t>-g»fc<:.H 

ttgcatrcaa tatctttart- gt.gg<:Kai;Hr. Ll.esgattga gaaccgcctg r.<:agt.eggKg 540 

ccctcracaga STcngatgggs i:f-yt:tyi--tfciL: t± Lg L^rgt t cctggccjar.w «LLdieLgCL 600 

tcccaacacc tgtggcctt* chygLLyuy I. ^tatcactcc ag t egg S t r.c: <:Lg1-LLg£Le 66 D 

tsrciDatcata ctccat^ar.c U-£:ctt:aage LLttctccta ccgggatg**: **aLi_-LgLggt 72U 

gccgccagc-a aaoggt-^a^g g<^*i««ft=Lg Lglictgcacja gaetgaagst-^ ayfcyggy<:*.y 7feD 

rtgcccagaa cact?jtn?iG^ tfltr^ggt^fl rt <:i;tgaccta ccgagatot-^ tw1ifca<:l.l-i:» R4D 

tCtttffCtCC tactttEJtgb h;*t-_gH»<;h<:a rtC <_t t CC LCtf atCCCCCCg.? wt.»<:g*rt«ijt: ODD 

get ttc tact acgsrcDDDt-t i:tt-i*»ysi.gi: u^tttttcac ccagcttc«tf g i.ygs^i-y* 3faa 
tccaocaats aatggtccct fli?fc»*.t:<-:fly« MCtccataaa g<?DDtt.c*eg g»c:«LgyatL 1U20 
attcacgaat cattBagcgt r-ti:H:***y^ t.ggcggtccc caacratctg al.ALgg^lica lOSO 

tCtLCttOta ttgsCt-t.Ct:r. r.ai:hcflbgti: Lcaattl CTtRt g<J*ag?>g^l-. r: crLgcaglt Lg 1140 

aaaaccacca gttotn^fl^g yhKil.ggLgija atqctsTaGtc t.gtcoc<: L--« l: ULLtggcaga 1200 

actggaatat cccogtgrnt: *»yt:ggtgc.& Lcacracactt ra™gt:iH. .sLgcteagac 1260 

tqagcagcaa cnMtgg* tg g<:<:ayg*i:i-g gggccttttt goegtcsgr-t: LLcttccatq 1320 

afltacctagt grigcatt^^ <:hg«ygtil-gL uocgcctctff gg^hf-r.nc:a gcratgatgg 13 BO 

cccaggtccc nctggnr^.gg al.L.y LgaACC gcttcttccn «ggg«Br:taL ggraatgcaqr 14-10 

ctgtgtgggt wcac>t:^»f.<: ai-.t-.yyy^aai: ^ggtgactgt gctr-af-gi^L ^ LLreacgart l!i00 

actaegtget cc*a<?rat-g«K gi:i:t:<tAgLgg .gggcctciagc tac?T.gw««« g$ccagcrct 1560 

ccctaacctu gg<;Dt.sg?*gr. t.rM.ggwyggc: ttectgsrctf? cctgcwcwfc c:<:U:oLagtc L620 

tce£2agg<?ct ctetgec^t n t-.gggg<rL:t_ti cLcctgctct tacsortf-ggr. A<:c:LgagtCt: 1696 

agctggtata aacc^gtgc^ ggg«gt.i:i-i*L gctgac-caaa srgotgrtgg^t. *kt:a*taaag I7d0 
^gctatctaa aaaaaaeiaaa tiftM^fl 1 



^•210> 17 

^•211> SOD 

':2l2> PRT 

<213> Kattus bp- 



<400> 17 

MftL Gly Aep Arg Gly tily Ala <*ly £Jsr Arg Arg Arg Arg Thr Gly 

lb 10 IS 



£«C ^rg Ski:- Iifr Gin Gly Gly i)er Gly Pro M«L Vdl Asp Glu <Jlti 
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20 2D sa 

Glu Val Ar(J A BP Ala Ala Vnl Gly Pro A5P Uu> Cly Aid Gly Gly A:;j> 
35 -40 

Ala Pro Ala Pro Ala P^o Val Para- Ala Pro Ala J^i* TTir Ai^g Aiip f.ys 
50 

Asp Arg GLn T*hr Ser Val Gly Asp Gly Hia Tr[> Glu F.r.*u Ai-y C.y* Ki& 

65 70 7* SO 

Arg Leu Gin Aap Ser Leu Phe Ser Ser Asp Sgt <:ly Ph<* <3nx Ajsh Tyr 
£5 90 P5 

Arcs Gly lie Leu Asn Trp eye Val Val Met Leu 11*% Leo f>u*r Aij-n Ale* 
100 10b "NO 

A^q Leu Phe Lea Glu Acn Leu lie Lys Tyr Gly lie Lou Vol Asp rro 
lib 120 125 

7.1 GIji Vul Val Ser Leu PJie Le=u Lys Asp Pro Tyr Ser Trp t*ro Ala 
13D 1SS 140 

Pro- Cye Leu lie lie Ala 5er Asn lie Phe lie Vnl Ala Thr J a he Lin 
14b ISO 15S 1 60 

lie Glu Lyes Arcj Leu Ser Val Gly Ala Leu Thr y.lu <LOr> Myh Oly Ijtii 
16D 170 17* 

Leu Leu His Val Val Asn Leu Ala Tbr lie lie Cye J a he L'r^ Alo 
1BU 14i5 190 

Val Ala Leu Leu Val Glu Ser He T-rsr Pro Val Gly Z~(?r l^u Mh^ Aln 
1SS 200 205 

Leu Ala Ser Tyr Ser lie lie Phe Leu Lys Leu Phe Ser 'jyr Arg A*p 
210 215 220 

Vl*l A::u Lc?ii Tx-p Cy:: Aixj <^Jti Arg Ax/g VAl Lye Ala Lye Ala Val Ser 
325 230 J J 5 240 

Alft Gly Lys Ly* V»l S-or 01y >Jf Al^ cn.ti At>n TJix Val Ser Tyr 

245 250 

Pro Asp Aba Leu U'hr 'iyr Ar<r Aep Leu Tyr Tyr The lie I J bs; A.1 a I J r^ 
260 26b 270 

Thx Leu Cys Tyr CjIu hen Aei^ PJie Pro Art? £er l J ro Arg l.lc? h.rg Lys 

10 
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275 2130 2Si; 

Ai-fr Pbe Leu ]^*vi Ax y A^g Val Leu Glu Met l^eu Phe rbe Vi'hr tUn Leu 
290 29b 300 

Gin val Gly i^eu Tin Gin Glai Tx-ji Wc=L Val P*o Thr lie Gin Asn tier 
305 31D J15 320 

Met Lys Fro Phe* l.y« A:ip Mfjl. Aip Tyc Ser Arg lie He Glu Arg Leu 
335 330 335 

Leu Lys Leu Ala Vw 1. Pru Asxj Kis: i.czu lit- Trp Leu lie Phe Phe Tyr 
340 345 350 

Trp Leu Phe Hi* Scr {tyr. l<*ij A%n Ai.n Vul M& Glu Leu Leu Gin Phe 
355 36P 36b 

Gly A3£> Arg Clu Ph<: Tyx- Aihj A=sp Tip Tip Asn Al<3 Glu j;er Val Thr 
370 37? .3IJU 

Tyr Phe Txp Gin A*n Trp Asr, J"l T>ro U»l H.ii; Lys Tip Cyft Ilii Ar^f 
"5 39<? 395. fl U0 

HiB Phe Tyr Lys Pro M«h T.rw Arcr J. £ u> Gly Sex. 1 Asn Ly* Trp Met Ala 
405 4LD 41& 

Arg 'JTir Gly V*l pjut L*-u Ala St-x- Aids Phe phe His Glu Tyr Leu vol 
420 425 430 

tier lie Pro bsu Ary Wnl. Phcs Axg ix*u Trp Ala Phe Thr Ala Met wet 
435 idi 

Ala Gin Vfll Pro >.nu A1h Tri> Tl« V«l AsSfj Ax*<r Phe Pile Gin Glv Asia 
450 455 *b0 

Tyr <51y Asn Ala Al»-Va1 Trp V«j Tin- L«u He He Gly Gin Pro Val 

470 4 60 

Ala Val Len >fet. TV 7 " v »l Hnr> A:;p Tyr Tyr Val Leu Asn tyr Asp Ala 
4P5 49U 49b 

Pro Val Gly Ala 
GOO 



<21D> 1H 
<211> 4ii6 
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<'213> Caencoctaabdit is 1 gans 
•::JOI)> 1H 

fife I. JVrg Glrj Gin Thr Gly Anj &rg Arg Ax*g Gin Pro £rvr Glu Thr Sex; 
IS 10 IB 

Asn Gly Rctc Li±u Ala Uex Ser Arg Arrj Sc-.z E&r Phe Ala COn A:m Gly 
20 25 

fcsn Sriir S«.r Axg Lys Ser Ser Ulu riot. Ai^ Gly Pro Uys Olv r..y* Vnl 
35 40 45 

Val F3i* T»n- Ala Gin Asp Ser Leu L'he Sisx- TJu* £3er Ser Gly Trp Thr 
50 55 i0 

A.sci Piio Ar^ Gly Phe Phe Aan Leu Ser 7 1 a Leu Leu Leu Vsl Leu Sor 
65 70 "?b eo 

A::n Gly Ax/g Val Ala- Leu Glu Asn \fa 1 Tie Lys Tyr <Jly IAo T*nj 71o 

00 9* 

Thi- Fj.-o Lciu Gliv Trp lie Ser Thr i'h« Vnl Glu isis his Tyr £*r II* 
10O 105 J.m 

Trp Sex? Trp Pro Aen Leu Ala Leu 11^ Lku Cys Sex Asn Ho ftlji 1*1 h 
lib 120 125 

t.ctm £ur V*l Phe Gly Met Glu Lys lie T^.n Glu Arg Uly Trp l^u <:iy 
1.30 13B 1*0 

Asn Cly Alri Al* Val Phe Tyr Thr So* l.c*u Val lie Ala His Leu 

345 1SCI 1C0 

Thr J"U? Pn> VmI Val Val PJir Leu "Thr Kis T.yr; Tip Lys Aen Pro Leu 
IfaS 170 175 

Trp tier Val Wet M^l; ttly Val Tyr Val lie Clu Ala Leu Lye Pbe 

180 l^i> 1 9 ° 

lie yer Tyr Cly Hi* Vol Tyr T*'P Ala Arg Ar?.p Ala Ax g Arg Lye 

1^5 200 205 

lie Thr Glu Leu Lys Thr Gin Thr A^p Leu Ala Lyr> ■ l.yw Thr- Cyt 

210 21S 220 

Asp Pro Lya Gin Phe Trp Asp Leu Lys Asp <^lu Leu Cer net Kis Cln 
225 23U 2." 240 
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HeL Ala Aids Gin Tyr Pro Ala Aaxi L«u Tin- Leu. Ser Asn lie 'lVr Tyr 
245 250 2D5 

Phe Met Ala Ala Pro Thr Leu Cys Tyr Glu Phe Lys Phe rro Arc Leu 
560 265 270 

Infill Arg lie Ar<i Lys His Phe Leu ri« Ly* Arg Thr Val Glu Leu lie 
2'j±> 12J10 

Phe- Leu Ser Phe Leu lie Ala AIa l«u Val Gin Gin Tip Val Val Pro 

295 

Thr- Val Ai-g AfcJl Ser Met Lyw Pro T/^y Se-r Glu Mt-I. Glu Tyr Ser PufQ 

3D5 310 315 iaD 

Cy£i Leu Glkl Ar<| L&U Leu Lyfc Lfcu Al« lie Pt'u AtfJ Leu lie Trp 

32£> 33D 33i 

Leu Leu P&e Phe Tyr Thr Phe Pii«; His sez Pkj* Leu Aen Leu He Ala 
340 345 3S0 

Glu Leu Leu Ar*| Phe Ala A£ip Arg <$1m Ph<t Tyr Airg Aiip Phe- Trp AfiA 

355 :sftn 

Ala Glu Thr He Glv Tyr PJie Tzp Ly^ £<±* Tzp Ai;ji lie Pro val ]Jia 
370 3Vb 

Arg Phe Ala Val Art? Hie He Tyr Ser Pro Met Met Arg Aen Asn Phe 
3U5 iOD 395 £00 

Ser Lys Met Ser Ala Phe The val Val Phe PJ\e val Ser Ala Phe Phe 
405 41t> 415 

I Lie Glu Tyr Leu Val Der Val Pro Leu Lyt^ Zl* Pn* Arg Leu Trp Ser 
420 ^25 630 

Tyr Tyr Gly MeL Met Gly Glu Ilci Prci f.r.y Sr?r Tic- Il« Thr Asp Lys 
43-5 4*° 4 * s 

val val Ar*j Gly Gly Arg Thr Gly Asn II v i:ic V* l Trp ser Leu 

dSO £55 4^0 

Hi? V*J- Gly Cil O Px<» r.rjsii Alfl He Leu Met Tyr Oly tti& A*p Ttt? Tyr 

470 450 



He Leu Aen Phe Glv val JJer Ala Val GIji As;/i Glu Thi Val Gly He 
485 l^S- 



PCT/US99/12541 



<2iQ> rj 

<2±2> IOTA 

<2l'J> CaenorhsbditiB elegane 

tttgatatqt acqataaarjT gaaaaaaast attcatstat [iscGCKjgtgg criataaa.tg8 60 

cactaaatat (ytttcaaang tatgasrcaaa eg c ntatgog ns^g^g;*^ -acitaagaaas. 120 

cgacctcitaa racatqaaaa atatcnatsg gaattttgag ntcsctt tt^ac iiacatacaat 180 

ataatgatta taataaaaaa cttsrtcttaa ciactagaaaa ctgetantte aaaaaaaaca 340 

aattgagata aatcaaacac caacsmtttq gttttoaact Ejccgaamcac caaagctcaa, 300 

<21<& 20 
<211> ifcl 
*212> DWft 

^213> Caenorhabdicist elegrms 



<:400> 20 

tEcsaatgcu 

tctttggctt 

ttcagaaatfj 

gacEjagttct 

actttcaaat 

ccttcagtsm 

tactctcatc 



ccagtngncg 
aaagga.cct t 
gciat^ncaa 
ggacsrcutgg 
acctcaacgt 
ct.at<rctcaa 



ggacoaegge 
ct cot cat tt 
gcaoLaaaaq z 
atttccatgpr 
cacttgaaan 
ttETttgaacD 

a 



rrccgtcagcc 
gcacaaaatrj 
ffBtac^tACt- 
attct tcant. 
t tgs tc«.ci?i 
rcactactcc* 



1 1 eggnaaac 
gt-nattcgt c 

t^t.rjgtrtt 1 1 
acttgs^gcr 



atctaatffet 60 
aaggaaaaaci 
crtttattttc 180 
tactttteot 240 
tgrttancacc 300 
ggccaaatcc 3CO 
381 



<21D> 21 

-:211> * 

<"212> PRT 

^2L3> Cc<nsejrve<3 Peptide ^eciuence 



tJ00> 21 

Met Ly& Al<s His. Ser ?Jtt= 
1 b 



<2L0> 22 
'.'2l2> DMA 

<3UU> 

<3D1> Cheng, 

-:303> J. Kiel. Chem. 
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<4O0> 22 

rredvdbrty Junastwdy<3 gfrva^yngVt. 
ggtyvvsyL* srrvmkahev rnv x-v-x>:»ak)c 
gcyvyyre£si s iiksarwcw &gut:Hhr;wnP 
wyyyaykdwij kr keaetmavfii vaawfiy^^u 



vvwl:wuunxl.:; vyJjWi Lgysk SfiJli-iihgingv fill 
ynhvtvriyyfl l.y rd:;ymL:y r-wgyv^tflJcev 120 
^cnrg-tSrmykcl wwnjsl::iy £j)y yrfcwnvvvbd 1B0 
^sywmgrrviTi i-kiewjw inwhngngv^y 240 

250 



<212> PRT 1 



<30l> Uelrnsn, 

^3G3=> J. Biol. chew. 

*3Cf4* 27 D 

OOV> 1995 



<400<> 23 

Arg Gin Sex Leu Leu AMp CV) u T.gu Ph* Olu Vfll A.'.sp Mis Tin Ai-fj Thr 

lie Tyr His M&t Phe II* AIh T.c*u I.ru Tl.« Lim PJje? VaI t-L:u fl*;r Thr 

20 25 SO 

lie Vil Vai A<:p Ty.T- Ti« Clu -<31y Arg [.mi l.«u ^ Phi? Asn 

i5 40 45 



Leu Leu Alii Tyr Alrt Fho Cly bys fhe Pro Thr Vfll T l.<? Trp Thr Txp 
5D 53 60 



Trp AIjs Mt±L. Phe L«u Scr Thr beu tfer lie l B ro iyr i J h^ L",*u L»h© Gin 
65 TO 75 BO 

Pro Trn Ala I Lie Gly Tyr Sor Lys Ser Scr Hi* Ptq t.cij Tic Tyr S«r 
R5 Pn 95 



Leu Val Hiss Gly 

Phi? V«l Pro Thr 
115 



Ijc*p Sic*-^ Pbe beu 



■IVx Val Val Leu 

X20 



Val L>h* 01 n Lpu 
105 

Ala Tyr Thr beu 



Gly V?il. Leu Gly 
110 

Pro Pro Ala iSer 
12 S 



IS 



WO PCTOJS99rt2«1 



A*3 L>he He He f.eu GLu Gin II* Ar 5 Leu JfleL ?.ys Ala Hi* 

"0 135 j4 0 

S«r fhe V*l Arc, CJlu A*n lie Pro Ajx) Viil ^eu ftsi) Aid Ala uy, 
" 5 !W 15H 160 

Ly« sex- l.ys Asp Pro Leu i-ro TJ.t.- Vol Asci 0.1 n Tvx- Leu Tyr Phe- 

16s 175 

Le., ?he Ala Pro Thr I.gu il„ <ty x Aiy Asp ash Tyr pil> Axct Thr Pro 
160 1H5 190 

Thr val Azrg Trp Gly Tyx v»l. ft la Met Gin Pho Leu Gin Val Phe Uly 

2DD 205 

Cy* Leu PL« lyr Val Tyx Tyr- He Ph<r Glu A*g Leu Cy* Ala Leu 

210 a a .s 220 

PhR Arg A*/. 71© Ly* Oln Glu rro Pho £!e r Alo Axq V d l Leu V«l Leu 
2:15 «0 235 2< iO 

Cy* Val Flu* fcsn iiftr Tie Leu 1'ro Gly Val Leu lie L.™ J'he I.r;u Sexr 
245 250 255 

Phe Phe Ala Phe Leu His Cy fl -rrp Leu asii Al* ?he Ai« ylu Mtrt Leu 
260 26 5 270 

Aig Phe Gly Asp Aig Mot Phc: -iyr Ly» Aep T*,> Tn> Asn Sex Thr Ser 
275 2S0 ^ 

Tyr Ser Asu -Tyr Tyr At^c, Trp A.-». Vnl Val Vol Mi:-. A 3 p Tx-^ Leu 

290 21D5 30 {i 

Tyr Tyr IVt- Vol Tyr L yB Asp Leu Trp Phe Ph* Ser Lys A*g i-he 

30b *15 320 

Lya Ser Ai« Ala Mel: Leu Al* Val Pti* Ala I.^u 3*r Ala Val Val His 

330 1J5 

Glu Tyr Mh Leu Al« He cy* Leu fi*r -ivx Ph* -jvx Pjto val r.n*u vh« 
340 **5 350 

Val Leu Pho Met PJh* Phe my Me* Ala Phft Asn Pho Zle V*J A &n 
3S5 3CU 3 G5 

S*^ ftrij i. y5 Arg Pro H« T r p A ± n JU Mlit . Val Tj7p Ala S(?r Le(j ^ 
37D 37D :i S|? 
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r.*u Uly Tyr rtly Leu lie J-tjy Cys Phe Ty** fi<?r Uln Glu Tup Tyr Aia 

395 3110 .19^ 4uu 

Arg Gin Mis <*.yn. (*ro Leu Lyri Asn Pro 
405 



■c210> 24 

•:2H> 429 

<212> PRT 

< 2 113 j- y«^5.f. 

<300> 

<303> a. Ki.cO . Chem. 
<304> 271 
<306> 24DS7-341G3 
O07> 199fi 

<400> 24 

Asp Ly& Aid Asp Ala Pro Pro Gly Glu T.yg Leu Glu £<?r ft&n Ph* £ter 
1 b 10 lb 

Oly lie oyi- Veil The Al* Trp Met PIik 1.<tv Oly Trp Ilti Ala lie flrg 
20 25 JO 

Oys eye- Thr asp Tyx Tyr Ala tier Tyr fily Sex Ala Tip fcr^n Lye l&u 
.15 40 45 

Clu He VjiI Gin Tyx Mc.r. Thr Thr Anp Leu Phe TJir 11* Al* MeL Ij?u 
bO 55 6G 

Asp Leu Ala wet PJi<t r.r*u L'ye Thr Phc> Phe Val VaJ Rhe Val lU:; Trp 
6£ 70 7b SO 

Leu Val T.yst Ly b AiLf Tie He Aati Trp Lvb Trp TJlt Oly Phe Vr±J Aia 

3D 9^ 

Val He Phe Gin Leu. Ala Ph* lie Pro Val Thr Ptie Px-o 'jyr 

lOO 105 110 

Val -Tyr lyr Phe liy^i L J he Aen Trp val Thr Ary Tie fhe Leu Phe Leu 
115 320 12S 

Hits S€>r Val VmJ Phe Val Mot Lys 5er Hiss .^pr Phe Alw Phe IVx Asji 
1J0 1 14U 

Cly Tyr L*,hi Tri> Aep Tie Lys Gin fily Leu Olu Tyr .^e: ilex Lya filn 

,17 
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145 15CJ 155 160 

Lnn Gin T,ys Tyr Lyis CUu Snr J.eu Ser L'^O Old 'J'hr Arg Clu 1 IP Leu 
lf>5> 17f) 175 

fijn ].y^ So/- <":ys, A=izi Pint <":y^ f.gn j^he Ulu lieu Asn 'iyr ^ln 'lUr by 5 
130 185 190 

Asp fcr*n Asp Fhn Pr« fr.zri Ar>n 11^ Ser <Jys Ser Asn Phe Phs Met The 

195 200 205 

Cyr; F*ni Ph* Pro Vt*l l^-u Val Tyr <iln lie Asn Tyr l J ro Arjj Thr £5er 

210 215 22D 

Aj-g 71 Arg 'prp Ai-g Tyr Veil Leu Glu bye val L'ys Ala He Il-e Gly 

725 230 235 

?hr lit: Pho r.r-u Me:K Hot. Vn 1 'rtir Ala Gin The ^be Wet Hi& Pro Val 
245 250 2B5 

Alu Mr-t Ar<j Cyr? "H ^ 0\n rhe His Asn Thr Pro Thr Phe Gly Gly Tip 
2fitl 265 270 

Hit Pm Al« Ttir f;l n 01 u J'rp Phe Mis Leu Leu Phe Asp tt£rt lis Pro 

275 260 2&S 

CJly Phw Tin ].ni> Tyr l>eu Thr Phe Tyr Met lie Trp Asp Ala 

290 295 300 

Tji^i T^i fc^n fiys V« 1 Ala. ^lu L*u Thr Ara Fht Ala Asp Arsr Tyr Phe 

305 >10 315 320 

Tyr t>ly kr-p Trp Trp A&n tvs Val tier Phe Glu GLu Pbe D-^r Arci He 
33> 330 Jib 

Trp A 5ii V*l Pre? Vnl Hie Uys PJie Leu Leu Arg >lie Val Tyr LI is St*i 
340 :545 350 

yex Ket <aly Ala Leu Hi Si £j£u J?i«- l.yji £>nr Gin Ai« Trir L*mj Phe; 'Pbr 

Phe Phe Leu L>er Ala Va3 Pfrfc JIju 01 ia Mrtl: Alu Mftu P>i<: Ala Tit; Fhc 
370 iHO 

Arq Arg VaL Ar<T <?ly Ty* Phr* M «t R % .n RIts T^imj Sor Clr: Pho Vnl 

3S5 iV>n 195 100 

T^i> TJ\r Ala LhMj Sen: A?n -Thr L-ys Phfi Lew Arg Aln Arg Pro Gin Leu 

lft 
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410 



415 



Sor ^r?Ti VhI Val Phe £Jer Phe Uly VsO Cy* 5*r Gly Pro 





2!> 


'-211* 


«2 




PRT 


*213^ 




<3QO> 






Viang . 






<K>4> 


27 2 






^n?> 




*4Q0> 





Civ Thr VhI VAl Thr Val Glu tfhr Thr i-1* He Sor Sc-sr Attn Phft S<ir 
IS 10 15 

Gly Leu Tyr V*l Ala Ph* Trp Met Ala He Ala Fh* 0)y M<» VhI T,ys 

25 -™ 

Alt? [.tNi H« Asp Tyr Tyr Tyr tsln His- Asn OLy ©nr Plus Lyw A*£j Shi 

40 

Gin Tlci L«u Ly& Phe Met l^hr Thr Aan Leu Ph* Thr VhI Ai* Scsj- Val 

50 ™ 

A^p Leu Leu Hat Tyr Leu Der T^hr 'JVr ("he Vol Vrtl <"ily llw GLu Ty = 
65 70 "J 5 

T^cni Cyn Lyii Tip Gly Val Lew Lys Trip Gly Thr Thr &ly Tt-p J J Ptus 

as 90 95 

Thr fUsr I .Ik Tyi Glli Phe Leu The Vol II© Pho Mcjtt Tyz- Lko TCi/; 

idd 105 im 

Glu A&n T]c t**u Lys Lifsu Hie Tip Lea iier bys II* Phe Lou Phc Lou 
115 1^0 125 

His Ser Leu Val l-tni Uiu Mnt. Ljm M«L JJie Ger t>he Ala Phe IV r Asn 

Gly Tyr Deu Trp Gly lie l/y.^Glu 61" Gin Phe Sejr Lys Uer Ala 

US 150 155 160 

L9 
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PCT/US99rt254t 



Leu £la J.y* Tyr Lys A.w Sex- I lo: Awn Asp Pro Ly b Vnl Tic GJy Ale 
165 1V0 3-75 

beu 01 u. Ly* $r>r Cy-: Gin Pho Cys £r>r Phe Glu Leu tier Cln £er 

IRQ IflS 190 

Leu Ser A*p 0"»n Thr Gin l.y:= Phr? Prn Aaaj Aen lie Ser Al*» Lys Sex 
195 200 205 

l*he Phe Trp Fhrs T^ 1 " Ph« *m Thr Leu lie Tyr «La lie Ola iyr 

210 215 220 

Pro Arc l^ir Lys. 0]« Jl* Arg fn? ^er ly* VhL ^ u G3 - u L ve lie eye 
225 230 235 2*0 

Ala lie i>he Gly Thr Tie P-h* W« Wr?fc lie Aep Ala Cln lie Leu 

245 2?>0 255 

Met Tyr Frn> V^l AIh Mnt. Arg Al^ f^u M« Val Arci Aen £er Glu Trp 
2&0 2G5 270 

Thr Gly lie Lev Anp Arg r.rr.i> Leu Cyr> TVi* Vtil Gly Leu Leu Val Asi> 
275 2B0 265 

lie Van Pro Oly P^t= JJ<-> V»] Mi?t Ty^ II* Leu Aep Phe 'ivr Leu lie 
?90 295 300 

Trp Asp Ala ¥le 1-cv Asn C:ys v*l OT« L<=« Thr Ai<i Phe Gly A bp 

305 310 HIS 320 

Ara "jyr Phe Tyr fHy Ar?p TVp Trp Asn f-V^- Ser Tip Ala Asp L*he 

325 330 33S 

Ser Arc lie Trp Asn lie L*ro V«l Hns- l.yr, CTiti Leu Leu Axg- His val 
340 345 350 

Tyr Bis 5er £Jer Ket Ser Eier Phe Lys Lew Asn T-yn Sc:r Gin A Lis Thr 
155 360 

Lea Met tfbr Phe Phe Leu Ser Ser Val Val His Olu Lou Al* M«L: Tyr 
370 37b 380 

Val lie Phe Lye Lys Lea Ar*r Phe Tvr Leu Phe. ^he i j hF? Ciln ifctfc. Leu 

"390 395 4uu 

Gin Met Pro Leu Val Ala Leu Thr Aen Thr Lys Phe net Arg &st^ A i^g 
405 410 4.1 5- 
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Thx 71s lie Oly ABT1 Val Ila PJm Trp 3.«u Gly rle Cye MeL Gly l>xo 
420 425 4 J D 



v.210> 26 

<?.)\y 29 

<212> DWA 

<2L.*¥> Artificial Sequence 
<22pj* 

<223.> Inscription at Arf.i Fi cial Sequence: ArabicJapr; j ^ 
ACA'J' Like prim** I or- V BACK 

<400-- 2G 

Hgnaajittoa CCraqfCacaoc; aticir.^ottC 29 

<?,1Q> 21 
c211> 28 
<212> .UNA 

<213> Artificial Gt-i^unnc^ 
<23Q^ 

■::22."5> Inscription ol Jix-r-.j f aciztl Sequence: Ai d t>i dopr; i 
AC AT Like paimtM- tar V KACE 

MHgijatQctt r.qagttc^l.9 rtiiantctga 2H 

<2in> 38 
<211> 31 
<21J>> DNA 

<213> Artificial Sem^ncc 

<V2!> Inscription iil Arti facial iieguencii : Resl- 5' Ff:R 
pr imer 

<aoc?> 2a 

*--Hggtgnca<y actcacK^ki*. coacca-ntcc <: 31 

<210* 2<J 
<2ll> 30 
<213> HNA 

<212j Ari-.if l.cial Sequent tt 



21 
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<2?.3> BttAc:r.-ipl:iOi) o£ Artificial Se=qt>6JJte: RaL ACAT-llK* 
cit::=LtMJ PCR pi a inner 



ogc<7# gctt- 5 n ac<5 t l L-da tg« t.t: r,g 

<21Q> 30 
<211> 3? 
*212> RKA 



.10 



<220> 

<-221> Dn^c^ipt: icijj ol Ax l.il J.^ipT 5aqu<?m;rii K^t- Ai:A7- 1 i V<? 
PCR prrljiiKir 

<:400> 30 

gg^^c^tgc ngglxAygiii: i:c:<:*<:tujggg catWJtn > 7 

<210> 31 
<211-> 41 
<2).2> DfNA 

<2L3v ArhiiitiiaL £tKJUtfnc:^ 
<22D> 

<223> nc»£<:i\iptitiJ> fcJL Ax. I. J. I i i:i « 1 .Sc^?ir.^r.:o: ^at. ACTAT-lik^ 
FCR piijutjr - 3 r 

gg^b^r^gr^gg r:c:ge:*<:£i*H:g ggcTgaoogeg *J3g - gng<~.gg'g n- 41 
<21D> 13 
<2L2> DNA 

<22D> 

<22.1> nc^ci"ii>t.i«ri of Artificial sequence :C elegans 
AC AT- 1 .i Jcrc FKTR pr i rrn?r- 5- ' 

<JDD> 33 

gg£il:<:r;gc:yg c:i lycj.-iciart l.g- r:g t r?fl.ciC^ail cgggctcgilCEJ g 41 

^210^ 33 
^21L:' 3S 
<:212> DNA 

<213> Artificial Sequence- 

22 
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<220> 

<223> Description of Artificial s^uence: C. el£r«an& 
ACAT- Like PCR priw&r-:3' 

■c400> 33 

smatcccctg caggtcaaat accaaL^i:?. t.yyiitttg 

<c21D> 34 
<2I1> 3* 
-z212> DNA 

■c213> Artificial Sequence 

*223> Description of Artifir;irtl Sr^i^cj^: Arfibi<3op5i& 5 r 
pen primer 

tcgacrcsrca ggaacicttag a*aL$y<:g«i- fcl.LggsLLc 

<2LD> 35 
*2ll> 3b 
<r212> ENA 

r-.^l3> Artificial sequence 
<22Q> 

<223> Description df Artificial Sequence: Ar*bidopei& 
ACAT- like 1" PCR primer 

3b 

ggatcccjcy^ ecgctcatga C£ L. Cy » fc. r: nr f.hhc:gg 

^21 D> 36 
<2ll> 
<2L2> DNA 

<2Li> Axtif:i^i.»] SRrjufinco 
<22Q> 

<233> Inscription of Artificial Secraence; cloning 
Adapter 

cgcg-atttaa s:tggc<Tc<rcc ctqreaggcgei ccgcctgcaci ggcgcgccat tt 

<21D* 3? 
*211> 33 
<212> DNA 

■:213> Artificial sequent 
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*220> 

<223> Description ui Arti i icial SEH]iii*m:t- : pOGNKSia 
Cloning' O li Lemuel eo Lid** 

<AO0> 17 

tcgaccatCC <JOEJEJOc:gCrttt yi"rl.l.<:<:tgr%« gg 32 

a:21D> IB 

<211=> 32 

<:212^ DMA 

*c213^ Artificial .Scqurnr^ 
<22D> 

^=223=* Description of Arfci.ficAwl Atexsuence: pOQHflfclS 
Cloning 0 1 i.g^mj r:l r.<i fc. icls 

clD0> 16 

tcgacctgca CEJ^GCt tgr? ggc^g^sgctt cc 32 
*c212=- DWft 

<2L3^ Artificial Sf^icnct: 
c220> 

<223> L*es crj.pt. i<hti of Artrlt icial Sequence : pOONBfcl P 
Cloning Oi -igpT>in;i ootid.-^ 

<400> 39 

tc^awtgca ggaagcttgo ggccgcOTat cc 32 

<210> 40 
*211> 32 
*212> DtiA 

■c213> Artificial S^QUcnc* 
<220> 

^22 3> Description of Artificial iJeciuence? pCGN8<51S* 
Cloning Oligonucleotide 

tcgagnatcc gcaErccgcotf gottcctgc-a *tb 32 
<210> 41 
iC21^> DMA 

<213> ArLi.lit:ial Sequence 
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<220> 

<221> Description of S&qisence: pCTUN8G20 

01 onlTUT 0 1 i goni J nl K" I- id* 

•c400> 41 

tcgaSFSJEltCC gcsgccgraa gcrkh^l-.guH yya^CL 

<2i0> 42 
<r211> 28 
<212> DWA 

<213> Artificial Sogufw;^ 
<220> 

<223> Description of Arti v\ M .Socpicsnite: pCGNB620 
Cloninc Oligonvcloot 

-c400> 42 

cctDcaggan gctt.gcss^ gi?gg»tcc 

*210> 43 
<21'1> 3G 
<212> DMA 

<213> Artificial £oc r ic<nr^ 
<22D> 

01 on ing Ci 1 igiw»u i-l c?«fc i «?w 
<100> 43 

tcgacctBM ncaagsttDC ggccgcGsat ccagrt; 

c2il0» 44 

<:211> 2B 

*212> WUA 

<213> Artiiicial Sequence 
<220> 

<223> Description of Artificial Sequenr-n: ]jCGN«£2l 
Cloning Oligonucleotide 

ggatccfrcDfl ccgcaagc-tt cctqoagcs 

<2Ll> :so 

<213> Artificial sequence 

as 
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<223> Description nt" Aiti iLczL al S^que-AC-e : Cloning 
Adapter 

caggaggc-gs ccg<:flgy«yy eLe^aggtfAt: id 

*210> 4G 
<21L> 31 
<212> D(WA 

<220> 

*223> DFt&Ciriptirxn ljI fir Li litis 1. Rciigumice: Cl Dnin-g 
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